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Puge strides have been made in the devel- 
Opment of machine-learning algorithms to 
generate what is commonly called artificial 
intelligence (AI). Looking to the forefront of 
how AI is being used in science and society 
reveals many benefits, as well as grand chal- 
lenges, that must be addressed. 
AI offers considerable benefits, particu- 
“larly when applications involve substantial 
epunts of data and high variability. For example, 
achine learning is being used to facilitate under- 
df standing of animal communication, which aims to aid 
conservation and husbandry. Applications in drug de- 
velopment and chemistry could substantially increase 
discovery and implementation. Moreover, there has 
been widespread development and deployment of ma- 
chine-learning algorithms in medicine, in the form of 
medical robots to improve patient treatment and re- 
covery and as platforms that facilitate diagnoses and 
treatment recommendations. But there are concerns 
about bias and discrimination. Finding ways to devel- 
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op machine learning without bias and developing AI 
models that can assess bias and health equity could 
help maximize the benefits for all. 

Such checks and balances are also needed as genera- 
tive AI is deployed. Is an artifact still “art” if produced 
by a machine? And what of copyright protection as 
generative algorithms draw from the work of others? 
There are also questions about retaining employment 
and the value of human skills built over years—might 
those who are most successful in the future be well 
practiced at using AI models, rather than having a tra- 
ditional skill honed over years? Concerns about the ex- 
istential threat of AI systems, to ensure that they are 
safe for democracy, social justice, and human rights, 
also need to be tackled and in a transparent manner. 

What will a machine-intelligent world of the future 
look like? AI cannot do everything—yet. It cannot smell, 
it cannot work in a team, it cannot feel emotions, it 
cannot invent, and it cannot reason. But it can help us 
reach further. The question is, how can we ensure that 
Al is safe so that the benefits outweigh the risks? 

10.1126/science.adj4843 
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EDITORIAL 


Al safety on whose terms? 


apid, widespread adoption of the latest large 
language models has sparked both excitement 
and concern about advanced artificial intelli- 
gence (AI). In response, many are looking to 
the field of AI safety for answers. Major AI 
companies are purportedly investing heavily 
in this young research program, even as they 
cut “trust and safety” teams addressing harms from 
current systems. Governments are taking notice too. 
The United Kingdom just invested £100 million in a 
new “Foundation Model Taskforce” and plans an AI 
safety summit this year. And yet, as research priori- 
ties are being set, it is already clear that the prevail- 
ing technical agenda for AI safety is inadequate to 
address critical questions. Only a sociotechnical ap- 
proach can truly limit current and potential dangers 
of advanced AI. 

Why safety? One could view the shift to safety 
with cynicism. Big Tech, weary from bad publicity, is 
seizing the chance to be viewed as saviors from al- 
gorithmic harms, not perpetrators 
of them. Sociotechnical approaches 
recognize and reject “safety-wash- 
ing’—giving lip service to safe AI 
systems, without requisite commit- 
ments and practices to ensure this 
is the case—and call for transpar- 
ency and accountability to keep 
companies honest. 

What does it mean to make AI systems safe, and 
what values and approaches must be applied to do 
so? Is it about “alignment,” ensuring that deployment 
of AI complies with some designers’ intent? Or is it 
solely about preventing the destruction of humanity 
by advanced AI? These goals are clearly insufficient. 
An AI system capable of annihilating humankind, 
even if we managed to prevent it from doing so, would 
still be among the most powerful technologies ever 
created and would need to abide by a much richer 
set of values and intentions. And long before such 
powerful “rogue” AI systems are built, many others 
will be made that people will use dangerously in their 
self-interest. Years of sociotechnical research show 
that advanced digital technologies, left unchecked, 
are used to pursue power and profit at the expense of 
human rights, social justice, and democracy. Making 
advanced AI safe means understanding and mitigat- 
ing risks to those values, too. And a sociotechnical 
approach emphasizes that no group of experts (es- 
pecially not technologists alone) should unilaterally 
decide what risks count, what harms matter, and to 
which values safe AI should be aligned. Making AI 
safe will require urgent public debate on all of these 


“We are making 


a familiar error.” 


questions and on whether we should be trying to 
build so-called “god-like” AI systems at all. 

How should AI systems be made safe? Narrowly 
technical approaches are not enough. Instead, Al’s use 
must be targeted in the context of broader sociotech- 
nical systems in which it is always embedded. This 
means considering the political economy of AI and 
recognizing when over-indexing on hypothetical fu- 
ture harms predictably risks entrenching the power of 
a few leading companies, leaving major current chal- 
lenges unaddressed. It means emphasizing the scarce 
environmental resources, expropriated data, and ex- 
ploited labor used to make advanced AI systems. It 
means considering how advanced AI will be used—for 
example, to further authoritarianism and illiberalism 
through mass surveillance and manipulation—which 
necessitates understanding people and societies, not 
just their technologies. And it means prioritizing em- 
pirically grounded work on actual AI systems and the 
risks they pose, rather than the (perhaps impossible) 
task of designing technical mitiga- 
tions for risks from systems that do 
not yet exist. 

Safety on whose terms? The field 
of narrow technical AI safety lacks 
ideological and demographic diver- 
sity; it is a near-monoculture and 
therefore inadequate to the intel- 
lectual breadth and rigor required 
of its mission. Its practitioners, disproportionately 
white, male, and advantaged, are often drawn from 
the Silicon Valley social movements of “Effective Al- 
truism” and “Rationalism.” A sociotechnical approach 
resists these structural imbalances, creating spaces 
for the wider participation necessary to steer our 
technological future on the basis of equal concern and 
common humanity. As well as a moral imperative, this 
is essential for understanding AI risks and building 
effective solution sets. Women and people of color 
researchers and advocates have led the way in both 
revealing the harms of new technologies and in miti- 


gating them; their exclusion from AI safety delegiti- . 


mizes that field. Sociotechnical AI safety shifts power 
away from the monoculture. 

We are making a familiar error. Faced with dis- 
orienting technological change, people instinctively 
turn to technologists for solutions. But the impacts 
of advanced AI cannot be mitigated through technical 
means alone; solutions that do not include broader 
societal insight will only compound AI’s dangers. To 
really be safe, society needs a sociotechnical approach 
to AI safety. 

—-Seth Lazar and Alondra Nelson 


10.1126/science.adi8982 
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Refik Anadol’s 
Unsupervised creates 
imagery inspired by 
works at the Museum 
of Modern Art that 
mesmerizes viewers. 


Art in the age of artificial intelligence 


There is more to art than Al-created artifacts, but computational creativity is worth pursuing ° 


By Joanna Zylinska 


fter being expelled from the Royal 
Academy of Fine Arts in Madrid just 
before graduation in 1926 for insult- 
ing the examination panel, 22-year- 
old Salvador Dali headed for Paris. 
This change of location proved trans- 
formative, both for the young artist and for 
art itself. In France, Dali went from imitating 
his masters to developing his own distinctive 
style. Merging imagery inspired by Sigmund 
Freud’s interpretations of erotic dreams with 
surreal hallucinations, Dali embraced as- 
sociation and automaticity as his creative 
method. This shift led to an exuberant pro- 
duction of canvases and other artifacts, an 
energetic pace that he maintained until his 
death in 1989. It is therefore highly apt that 
when, in 2021, OpenAI launched its deep- 
learning model capable of generating novel 
images from natural-language descriptions, 
it called the program “DALL-E” 
A tribute to the famous surrealist artist as 
well as to Pixar’s animated robot WALL-E, 
the current version of the model—known as 
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DALL-E 2—features automaticity, genera- 
tivity, and a dreamy aesthetic. Made widely 
available, DALL-E 2 and its fellow image 
generators Midjourney and Stable Diffusion 
have evoked fascination and horror in both 
the art community and the general public. 
And understandably so, given that such 
models have the potential to be more trans- 
formational than even Dali. 

Artificial intelligence (AI)-based image 
generators have raised the prospect of the 
emergence not just of a radically new art 
movement but of the end of art itself—and 
especially of the artist as individual human 
genius. Their arrival has also posed a fun- 
damental philosophical question: Can ma- 
chines be truly creative (1)? Salvador Dali’s 
artistic trajectory can help us find some an- 
swers, illustrating rather poignantly the key 
issues that underpin AI art, or, to give the 
phenomenon its proper name, multisensory 
computational creativity enabled by ma- 
chine learning (ML). 


CODIFYING CREATIVITY 

One precursor to Dali’s artistic shift was his 
adoption of a “style transfer” through which 
he painted first like a Renaissance master 
and then like Picasso. This same imitative 
method is presently employed in projects 


such as Microsoft’s The Next Rembrandt. 
Throughout his career, Dali mined exist- 
ing repositories of art history resources and 
used what he found there as inspiration to 
produce strikingly new artifacts. In effect, 
the Spanish artist adopted a “combinato- 
rial” method, remixing earlier styles and 
tropes to arrive at something that looked 
truly original to his contemporaries. With 
this approach, Dali put into practice an 
understanding of creativity as “the ability 
to come up with ideas or artefacts that are 


new, surprising, and valuable” that was . 


codified nearly a century later by computer 
scientist Margaret Boden in her celebrated 
book Creativity and Art (2). 

Boden’s definition is itself novel because 
it allows for the possibility that creativity 
may be exercised by both humans and ma- 
chines, thus opening the door to defining 
art in a way that exceeds its humanist leg- 
acy. Her understanding of creativity can be 
applied to both Dali and DALL-E if we rec- 
ognize humans’ continuous evolution with 
and through technologies, from stone tools 
and weapons, through industrial machines, 
to digital networks—an idea articulated by 
philosopher Bernard Stiegler (3). 

It is worth remembering that the anxi- 
ety evoked by ML technology in relation to 
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art has historical precedence. In the early 
1820s, for example, it was feared that the 
invention of photography would lead to the 
death of painting. Instead, photography 
generated an explosion of new ways to see 
and create images—including painted ones. 


AVERAGED DATA, AVERAGE ARTIFACTS 

Novelty and exuberance aside, the first 
phase of AI art enabled by ML was pre- 
mised on an engineering-based approach 
to creativity—and on quite a conservative 
idea of art. Art in this context amounted to 
little more than the averaging of possibili- 
ties, as evidenced in the works of computer 
science-trained creators such as Mike Tyka, 
Mario Klingemann, and Memo Akten. 

Deploying generative adversarial net- 
work (GAN) technology, their images ex- 
plored the visual effects that arise when 
an ML model is trained on a database of 
photographs of human faces or historical 
artworks and made to produce their ap- 
proximations. Divergences from the source 
material, obtained through manipulating 
the learning algorithm and an inevitable 
element of surprise, led models to create 
hallucinogenic images of still or morph- 
ing faces in the style of Francis Bacon, 
with a sprinkling of Dali’s “melting clock” 
aesthetic. The result was thus not just the 
averaging of data but also the production 
of rather average artifacts, albeit ones en- 
veloped in narratives about AI’s supposed 
revolutionary potential. 

How should we think about the qualities 
of these new artifacts deemed “art,” and by 
whom should judgments about them be 
made? First, we must recognize that there 
is a history to art that must be approached 
critically. Art-making is not merely the pro- 
duction of “beautiful” artifacts; the very 
notion of beauty is itself historical and 
contextual. In addition, art has recipients 
to whom its various meanings matter, con- 
tested as they may be. In some iterations, 
art also has financial value, is treated as 
an investment, and functions as part of a 
global exchange of capital. 


PERFORMING CREATIVE POSSIBILITIES 
Artist Refik Anadol’s Unsupervised (2021- 
2023) takes the GAN aesthetic and logic to an 
extreme—but it also opens up some intrigu- 
ing visual and conceptual avenues. For the 
project, Anadol trained an ML model on a 
database of works from the Museum of Mod- 
ern Art’s collection, which includes pieces 
created over a span of 200 years. The result- 
ing display, shown on a huge vertical screen, 
offers mesmerizing sequences of morphic 
images that hint at its data source while cre- 
ating some new connections between, and 
reworkings of, the original pieces. 
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Ungenerous readings have compared 
this project to a screensaver or a lava lamp, 
but acerbic reviews do not convey the pal- 
pable pleasure experienced by audiences 
standing in front of Anadol’s work. View- 
ers are often visibly affected by the display, 
moving their bodies with the undulations 
of the screen images. 

Unsupervised manifests deep existential 
undertones, unwittingly enacting a propo- 
sition by the cybernetics-inspired philos- 
opher of technology Vilém Flusser about 
image-making being always also a function 
of the apparatus, and not just of the dis- 
crete human mind (4). Acknowledging that 
his work is the outcome of a collaboration 
between human and machine, Anadol ex- 
plains: “With the same data, we can gener- 


The surprising amalgamations that appear in 
Unsupervised invite audiences to reflect on a potential 
role for machine learning in the creative process. 


ate infinite versions of the same sculpture, 
but choosing this moment, and creating 
this moment in time and space, is the mo- 
ment of creation” (5). 

Because of the computational power 
available today, infinite variations can be 
enacted for us by machines. The force of 
Anadol’s work therefore lies not only in its 
mesmerizing visuality but also in its perfor- 
mance of the possibilities within the latent 
space of the ML model. 


Al ART ENABLES REFLECTION 

To say that machines can simulate intelli- 
gence, identify patterns, generate novel out- 
puts, or cocreate work with humans is not to 
suggest their sentience or sapience. Rather 
it is to recognize that what humans see as 
intelligent behavior, patterns, and outputs 
is conditioned by both our cortico-corporeal 
apparatus and by our environment. By re- 
turning our values and meanings back to us 
for further reflection, machines can help us 
see ourselves as embodied beings who exist 


in a sociocultural context. 

Many artists working today engage in 
this kind of second-level inquiry into the 
conditions of artistic and technological cre- 
ation with AI. These include studies of algo- 
rithmic bias in the works of Jake Elwes and 
Murad Khan as well as Katja Novitskova’s 
exploration of AI art as “art for another in- 
telligence” (6). 


POLICY, NOT PANIC, WILL HELP ARTISTS 
What lies beneath current questions about 
machinic creativity is a deeper anxiety 
about the ways humans can continue to 
be creative, for how much longer, and at 
what cost. Given the outpouring of visual, 
textual, and sonic artifacts enabled by AI, 
we have begun to ponder the future of hu- 
man creative professions and pastimes. 
These developments raise questions about 
labor, about how we value artifacts and . 
institutions that enable the experience of 
art, and about art education. Even though 
artists have always borrowed, copied, and 
remixed material to produce their wares, 
the plundering of cultural repositories, 
without due compensation or, indeed, rec- 
ognition of individual creators, for the sake . 
of the construction of AI training models 
needs to be interrogated, for reasons of 
moral and economic justice, if not human- 
ist panic. 

More ominously, however, reducing art to 
the production of artifacts, in the way hype 
around “generative AI” has managed to do, 
could be read as part of a deeper agenda; 
one concerned with devaluing any form of 
human cultural activity that cannot be eas- 
ily discretized and monetized. If we want 
to live in a world in which there is room 
for future Dalis and not just new iterations 
of DALL-E, we need less moral panic about *‘ 
“creative AI” and more sensible policy work ° 
around creative education, art institutions, 
and funding models that ensures such prac- 
tices are not governed only by the logic of + 
the financial markets or the optimization 
efforts of Big Tech. 
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PERSPECTIVE 


Artificial intelligence meets medical robotics 


rtificial intelligence (AI) applications in medical robots are bringing a new era to medicine. Advanced 
medical robots can perform diagnostic and surgical procedures, aid rehabilitation, and provide symbiotic 
prosthetics to replace limbs. The technology used in these devices, including computer vision, medical 
image analysis, haptics, navigation, precise manipulation, and machine learning (ML), could allow 
autonomous robots to carry out diagnostic imaging, remote surgery, surgical subtasks, or even entire 
surgical procedures. Moreover, AI in rehabilitation devices and advanced prosthetics can provide 

individualized support, as well as improved functionality and mobility (see the figure). The combination 

of extraordinary advances in robotics, medicine, materials science, and computing could bring safer, more 

efficient, and more widely available patient care in the future. -Gemma K. Alderton 


Leveraging Al for medical 
image-guided robotics 


By Michael Yip! and Septimiu Salcudean? 


Medical image-guided robotics combines medical images, where 
locations of key anatomy, lesions, and objects can be identified, 
and robotics, where the precision placement of instruments or 
tools provides substantial advantages. Frequently leveraged imag- 
ing modalities include ultrasound, magnetic resonance imaging, 
computed tomography, and white-light or fluorescent endoscopy. 
The robot may be used to assist in imaging anatomy, or the imag- 
ing may assist the robot in guiding it to key targets. 

Many early use cases for AI in medical image-guided robot- 
ics involved steering instruments, typically needles, to identify 
anatomy for biopsies. The AI focus was typically on the steering 
mechanics and planning algorithms because traversing soft tissue 
involved curvilinear paths that were constrained by the minimum 
radius of curvature through the tissue, as well as tissue displace- 
ments during instrument insertion. Although challenges persist 
with solving navigation for minimally invasive robotic tools with 
multiple and intermittent contacts with tissue, a large scope of 
solutions involving AI-driven planning for robotic steering of 
instruments are now available (J, 2), and highly dexterous robotic 
systems have been proposed that can plan and reach targets with 
the aid of AI (3). 

Most effort now concerns image understanding. Previous tech- 
niques used tedious, human-segmented (hand-annotated) anatom- 
ical features and/or weak, feature-based recognition of anatomy of 
interest. New AI strategies for image guidance leverages semantic 
information—i.e., higher-level reasoning about the type of anatomy 
and its characteristics, identified directly from pixel information, 
to provide improved, safer navigation. As with the considerable 
improvements in object localization and scene segmentation in 
computer vision, these techniques have begun their translation to 
surgical scenes and surgically relevant objects (4). These translated 


techniques can then be leveraged by robots to plan and reach 
targets with high accuracy. 

An interesting use of image understanding is in image acquisi- 
tion itself (5). For example, robot assistance makes medical ultra- 
sound more consistent, enables autonomous scanning, and may 
improve access to examinations in remote and underserved com- 
munities. Moving the ultrasound transducer against the patient to 
identify the standard imaging planes used in medical diagnostics 
is a challenging AI problem involving deep learning networks 
to predict necessary probe motions (6), reinforcement learning 
(RL), and learning by demonstration (7). Robotic assistance is 
also used in endoscopy, where rigid or flexible endoscopes, and 
even capsules with magnetic actuation, are deployed in surgery, 
gastrointestinal tract imaging, and bronchoscopy (8). Maneuvering 
endoscopes is challenging and requires considerable experience 
to be mastered, making automatic motions attractive. Approaches 
based on learning by demonstration are promising, e.g., by using 
behavioral cloning (6). AI techniques are used in endoscope local- 
ization and mapping and in the analysis of the very large image 
datasets involved in diagnosis (9). 

Intraoperative ultrasound and x-ray imaging enable the registra- 
tion of preoperative medical images to the patients for biopsy and 
surgery. Localizing targets identified through intraoperative regis- 
tration in the corresponding laparoscopic camera views remains a 
challenge. Maintaining such registration requires long-term tissue 
tracking using computer vision techniques, in the presence of tis- 
sue changes as the intervention progresses (4). 

Unlike building neural network models for everyday scenes, 
finding labeled data for training models for medical robotic ap- 
plications is a substantial bottleneck. Accurate labeling must be 
done by trained professionals such as surgeons and radiologists. 
Thus, getting ground-truth data is very expensive and not scal- 
able. Synthetically generated images help to address a part of this 
problem, but synthetic images are sufficiently different from real 
images and can lead to overfitting (10). Given the related chal- 
lenge of human labeling of intraoperative video sequences to train 
neural networks, unsupervised or weakly supervised approaches 
are desirable (1). 
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Smart medical robots 


Various applications of artificial intelligence (Al), including 
machine learning, machine vision, and haptic control, 
have resulted in the development of robotic devices that 
can be used in all aspects of patient care, including 
diagnostics, surgical procedures, rehabilitation, and 
limb replacement. The use of robotics in medicine 

aims to ensure consistent, safe, and efficien 
treatment, as well as allowing data gathering for 
improvement and potentially increasing access 

to treatment in underserved communities and remote 
regions, and to those affected by natural disasters. 


> 


Surgical robots 
Various levels of autonomy can be used in robots 
that carry out surgical tasks. - 


Da Vinci robot teleoperation 


Image-guided robotics can aid diagnostics and delivery of 


Soft robotic device 


Rehabilitation devices and advanced prosthetics 
Wearable devices can improve recovery from injury and facilitate human-in-the-loop intervention. 


Soft exosuits 

Wearable robots in the form of 
soft exosuits can autonomously 
facilitate rehabilitation while 
capturing data to improve 

gait training and personalized 
performance monitoring. 
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Upper limb prosthetics 
Recognizing user intent is important 
for upper limb prosthetics to, 

e.g., modulate hand grasping force 
and appropriate posture. This 
ensures human-prosthesis symbiosis 
to achieve robust control. 


interventions through analyzing medical images. Moreover, 
autonomous robots are being developed to carry out imaging, 
e.g., ultrasound imaging to access standard imaging planes 
consistently. Image guidance is also important for autonomous 
surgical planning and procedures such as endoscopy. 


Smart Tissue Autonomous Robot 


(STAR) for Suturing 


Surgical robots can take various forms, including teleoperated 
devices that allow surgeons to carry out complex procedures without 
fear of tissue damage from hand tremor. Soft robotic devices 

are under development for minimally invasive surgical procedures, 
providing haptic feedback to surgeons as well as ensuring safe 
manipulation of and navigation through soft tissues. Semi- 
autonomous robots that can undertake surgical subtasks, such 

as suturing and debridement, are also under development. 

This could potentially lead to fully autonomous surgical robots. 


Lower limb prosthetics 
Replacement of lower limbs 
requires prosthetics to 
adapt to the environment, 
e.g., through machine vision, 
to ensure stable and 
intentional limb motion. 
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Ultimately, the combination of new techniques in data-efficient 
learning, coordinated efforts in growing and retaining labeled 
medical image datasets, and advances in safe and dexterous 
robotic systems will push the relevance of medical image-guided 
robotics to the forefront of the next generation of interventional 
and diagnostic care. 


Supervised autonomy in robot- 
assisted surgery and telesurgery 


By Ken Goldberg? 


Each year, more than a million surgeries are performed with 
robots (12). These robots are very sophisticated but not at all 
autonomous—they are fully controlled by human surgeons. This 
is because surgery is fault-intolerant, there are a vast number of 
rare but potentially dangerous edge conditions, and the conse- 
quences of even a single failure can be fatal. It may be a long time 
before fully autonomous robots are sufficiently safe and reliable 
for the clinic. In the meantime, substantial progress is being made 
in the use of “supervised autonomy,’ where specific subtasks are 
performed by a robot under close supervision of a human, who is 
ready to take over when necessary (13, 14). 

Consider a surgical subtask such as debridement: the removal of 
damaged tissues or foreign fragments from a wound. Debridement 
can be very tedious; it is easy for a surgeon to overlook fragments, 
which can lead to infections. Debridement is a surgical task that 
could benefit greatly from supervised autonomy, whereby a surgi- 
cal robot and camera system could systematically identify and 
remove fragments under close supervision of the surgeon who is 
ready to take over if the system misidentifies a miscolored human 
tissue as a foreign fragment. Supervised autonomy for surgical 
debridement has been achieved in laboratory conditions (J5, 16), 
but research is still needed to extend these results and evaluate 
them in vivo. 

Another example is surgical suturing. This surgical subtask is 
often left to medical residents because it is tedious and some- 
what fault-tolerant. Surgical suturing requires even placement 
of sutures that balance tissue forces. Supervised autonomy could 
produce sutures that are more consistent, thus reducing healing 
time and scarring. 

Researchers are studying how suturing could be performed us- 
ing supervised autonomy, in which the surgeon can outline 
a wound by touching its boundaries with an instrument, and the 
system computes and displays an optimal placement of needle 
entry and exit points that evenly distributes tension across the 
wound. The surgeon can then adjust these points or allow the ro- 
bot to autonomously perform the suturing under close supervision. 
Researchers have demonstrated initial results in the laboratory 
(17, 18), but securely gripping surgical needles during insertion, 
handing needles back and forth between gripper tools, and manag- 
ing the slack in surgical thread must be addressed before suturing 
can be considered for testing in the clinic. 

Supervised autonomy also opens the door to “telesurgery,” 
whereby an experienced surgical expert can guide a surgery at a 
distant location. Telesurgery has the potential to considerably 
increase access to skilled surgeons in remote regions or during a 
natural disaster (19). Direct control of all surgical instrument mo- 
tions by the expert is not possible because of the inherent time de- 
lays of optical and electrical signals, which cause any direct control 
loop to be unstable. Supervised autonomy can solve this problem 
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by allowing surgical subtasks to be locally controlled, with inter- 
mittent remote supervision. Telesurgery has been demonstrated 
with the use of dedicated high-speed fiber optic networks and is 
being actively explored by researchers, but it is not yet approved 
for clinical use. 

Advances in sensors and ML have evolved robotics considerably 
in recent years, and there are many opportunities for AI in the 
operating room. AI can be used to enhance digital camera images 
and answer spoken queries from a surgeon during a procedure. 
But supervised autonomy offers the greatest potential for reducing 
tedium and improving patient outcomes. Researchers worldwide 
are exploring how supervised autonomy can be used to enhance 
robot-assisted surgery and telesurgery (20). As the original patents 
on surgical robotics expire (27), new commercial surgical robot 
systems are increasing the diversity of hardware and interfaces. 
This competition is motivating the commercial development of 
new functionality, and it is likely that supervised autonomy will be 
available in the clinic in the coming decade. 


Soft robotics for minimally 
invasive surgery 


By Kaspar Althoefer* and Arianna Menciassi® 


Over the past few decades, there have been considerable advance- 
ments in robot-assisted minimally invasive surgery (RAMIS). RAMIS 
systems use slender, straight-line instruments to operate through 
small incisions in the patient’s skin. Robotics makes procedures 
simpler, filtering the manual tremor of surgeons, improving overall 
ergonomics, and restoring three-dimensional (3D) vision that is nor- 
mally not possible in manual minimally invasive procedures. In ad- 
dition, RAMIS allows the generation of a huge amount of data that 
can be used for improving safety and implementing some autono- 
mous tasks (22). Despite the success of some RAMIS platforms, such 
as the da Vinci Surgical System for prostatectomy and abdominal or 
thoracic surgery, these systems are often limited by their rigid com- 
ponent design, which can make it difficult to access certain areas of 
the body and can lead to tissue injuries. 

Soft robotics is a promising avenue for developing more flex- 
ible and adaptable surgical robots, with the necessary dexterity 
and stiffness modulation to perform surgical procedures safely. 
The key feature of soft robotics is the use of materials that can 
deform, bend, shrink, and change stiffness (23), pushing the 
paradigm of robotic surgery in a safer and softer direction. These 
robots address different body regions, such as the ear, abdomen, 
and thorax, and they can be dedicated both for diagnosis and 
intervention. For example, a fluid-driven soft robotic system was 
developed for increasing patient comfort during ear therapy and 
safely steering a needle to the desired injection site (24). Diag- 
nosis of gastrointestinal tract pathology is also a key application 
for soft robots, because these tissues are flexible, stretchable, and 
often collapsed, requiring a spectrum of soft and stiff working 
modalities (25). A capsule robot for endoscopy that uses ever- 
sion navigation and a soft shape-shifting mechanism has been 
recently demonstrated (26). 

A large-scale project to explore soft robotics for RAMIS was the 
European Union project STIFF-FLOP (stiffness controllable flex- 
ible and learnable manipulator for surgical operations) from 2012 
to 2015 (27). The soft robotic systems that were developed were 
made from biocompatible silicone rubber and pneumatically actu- 
ated, by using new fabrication methods that allow for the creation 
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The Smart Tissue Autonomous Robot (STAR), which is holding a needle driver, needle, and suture in preparation for suturing intestinal tissues, is inspected. 


of reliable structures that are also safe and effective. In addition, 
advanced ML techniques were employed to intuitively teleoperate 
the soft robots in the abdominal cavity of the patient, and haptic 
systems allowed surgeons to discern interactions of the robot with 
the soft tissue environment. 

There remain substantial technical challenges (28). A major 
issue is the lack of precision and accuracy in soft robotic systems. 
In traditional surgical robots, electrical motors are used directly 
or by means of tendons to move the robot’s arms, and effectors are 


made from rigid components that do not deform during operation. 


However, soft robotic systems rely on deformation of the mate- 
rial that the robot is constructed from to achieve movement. The 
resultant motion is more difficult to model and can result in lower 
positional accuracy, which could be a critical concern in surgery 
(29). To overcome this challenge, advanced strategies based 

on AI, ML, and data-driven control that can cope with the highly 
nonlinear motion behavior of soft robots are being developed. 
Recent advances in computer power, computer vision, ML, real- 
time modeling, and simulation can make operation of soft robots 
for surgery possible without cumbersome teleoperation modalities 
and extensive training sessions for surgeons (30). 

Will soft robots for RAMIS replace well-established surgical 
robots, or will soft robotic design rules, relying on morphological 
computation (37), permeate traditional technologies for RAMIS? 
What benefits the patient most needs to drive research in soft 
robotics surgery. 
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Bringing highly autonomous 
surgical robotics to the clinic 


By Justin D. Opfermann®*’ and Axel Krieger®”* 


Autonomous surgical robots are the surgeons of the future. They 
have the power to standardize patient outcomes independent of a 
surgeon’s experience and skill (32), they integrate AI and dexter- 
ous tools to perform tasks with more consistency and accuracy 
than expert surgeons (33), and they can provide essential care in 
environments where no surgeon is available such as human space 
flights (34). Such robots will democratize health care by making 
quality surgery ubiquitous and by minimizing the incidence of cor- 
rective surgery, thereby reducing health care costs. Although most 
systems are not clinically approved today, they will certainly play a 
role in the future. 

Generally, autonomous surgical robots are classified by their 
respective level of autonomy (LoA) and incorporate algorithms 
that are responsible for increased surgical decision-making (35). 
As the level of autonomy increases, so too does the complexity 
of the robot’s role in surgery and the amount of AI integrated 
with the system. For instance, the LoA 0 (no autonomy) da Vinci 
Surgical System uses human teleoperation without AI to perform 
surgery, whereas the LoA 1 (robot assistance) EndoAssist camera 


science.org SCIENCE 


& 


¢ 


A MACHINE-INTELLIGENT WORLD SPECIAL SECTION 


PHOTO: HARVARD BIODESIGN LAB 


holder uses algorithms to restrict tool motions (36). For higher 
levels of autonomy, the surgeon defers control to the robot, which 
uses AI to execute surgical tasks. During LoA 2 (task autonomy), 
a robot may use learning by observation to independently cut 
tissue as with the da Vinci Research Kit (37), whereas the LoA 3 
(conditional autonomy) Smart Tissue Autonomous Robot (STAR) 
uses ML to track soft tissue deformation to execute the surgical 
plan of suturing (38). The LoA 4 (high autonomy) and LoA 5 (full 
autonomy) surgical systems are not yet feasible with today’s tech- 
nology, but their development is on the horizon. 

There remain several technical, regulatory, and social chal- 
lenges to solve before the highest levels of autonomy can be 
reached. Robots will need to better detect, process, and respond 
to unpredictable variations in the surgical field. These challenges 
will be magnified in soft tissue surgeries, where deep learning 
and AI will be necessary to predict, and react to, the changing 
surgical scene (39). The resulting technologies will face increased 
regulatory scrutiny because it is not clear how a system capable 
of practicing medicine would be regulated by the US Food and 
Drug Administration (FDA) or medical device regulators else- 
where. These systems will need to demonstrate safety that exceeds 
teleoperation and efficacy as good as the clinical standard of care. 
For AI algorithms, this means sensitivity and specificity at least as 
good as an expert surgeon’s. Social pressures might also temper 
adoption as there is general resistance with using AI in medicine 
(40). To build public trust, autonomous robots will likely follow 
a stepwise approach to adoption, in which autonomous subtasks 
such as tissue identification, endoscope control, and suturing will 
be gradually introduced. These tasks will then be combined into a 
full procedure, paving the way for autonomous robotic surgery in 
the operating room of tomorrow. 


Rehabilitation robots to go 


By Krithika Swaminathan? and Conor J. Walsh’® 


Rehabilitation needs to extend from the clinic into the community 
and home to provide patients with a continuum of care. Toward 
this goal, engineers, clinicians, and end users have developed 
wearable robots that allow people with mobility impairments to 
practice and experience better movement. Although these systems 
have historically been seen as assistive technologies, ongoing work 
is demonstrating that these portable and autonomous robots, and 
the data that they capture, can lead to fundamentally different 
rehabilitation approaches. One can imagine a future in which 
wearable robots are used in the clinic to reduce the physical bur- 
den on clinicians while learning patient-specific impairments, and 
are then sent to the patient’s home to track their recovery with 
individualized ML algorithms. 

Recent technological advances in hardware and software are 
facilitating this shift to real-world use (41-44). Innovative ap- 
proaches for actuation hardware achieve higher specific powers 
(power per unit mass) through cable-driven (45) or pneumati- 
cally-driven mechanisms (46). Soft, apparel-based interfaces can 
be lightweight and nonrestrictive to the user and let a device 
smoothly transition between assisting motion, being mechani- 
cally transparent, and resisting motion for strength training. The 
increased modularity of these systems also allows for tailoring the 
technology to individual-specific impairments. Along with new 
mechanical designs, learning-based estimation algorithms now 
use wearable sensors to detect and quantify movements (47), and 
control strategies now bring users into the control loop to provide 
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individualized intervention (48). These techniques further enable 
studying the rehabilitative potential of high-dosage and high-in- 
tensity training beyond the clinic (49). 

Wearable cloud-connected robots will usher in the era of 
data-driven physical telerehabilitation. Integrated sensing can pro- 
vide clinicians and users with feedback on important biomechan- 
ical and physiological metrics, similar to smart fitness trackers. 
However, optimally combining this feedback with clinical input to 
guide motor learning and encourage behavioral change remains an 
open challenge. Moreover, developing generalized data-driven ML 
and AI algorithms in clinical populations is particularly complex 
owing to the limited data available from any given individual and 
the large variability across individuals. Continuous movement as- 
sessments also increase the temporal resolution of characterizing a 
patient’s progression, enabling more timely and accurate detection 
of performance degradation or improvement. 

However, a challenge remains in developing estimation algo- 
rithms that are robust to noise across recovery timescales (months 
to years), owing to sensor drift, sensor placement, environmental 
changes, and day-to-day user variability. Validating these ap- 
proaches is further complicated by the lack of ground-truth data z 
in real-world settings. If successful, the corresponding data from 
the patients and robots can be used with ML techniques to iden- 
tify who is most likely to benefit from a given device (50). Such 
categorization will enable efficient prescription of interventions, 
minimizing costs for clinicians and patients alike. The field will 
then have new opportunities to investigate how longitudinal data 
can inform new methods for individualization of training parame- 
ters, such as for biofeedback and device controller optimization. 

Data from wearable robots worn in the real-world will inform 
the design of computational models and experimental validation 
of human-robot interaction during rehabilitation, ultimately lead- 
ing to adaptive systems that better synergize with end users. 


A soft robotic, cloud-connected glove facilitates high-dosage and high-intensity 
self-directed teletherapy for stroke survivors in the home. 


14 JULY 2023 + VOL 381 ISSUE 6654 145 


SPECIAL SECTION 


A MACHINE-INTELLIGENT WORLD 


Al enables symbiotic 
robotic prosthetics 


By He (Helen) Huang"? and I-Chieh Lee™2 


Advanced robotic prosthetics, such as dexterous prosthetic hands 
and motorized prosthetic legs, have led to a paradigm shift to 
restore the mobility of individuals with limb loss (57, 52). These 
modern prostheses have embedded AI into machine operation 

to enable adaptation to user intent, environments, and the user’s 
physical condition. This is essential for human-prosthesis symbio- 
sis—an intelligent prosthesis and a human user functioning seam- 
lessly together as one system in daily life (53). 

For example, AI has enabled neural control of prosthetic limbs. 
Controlling prosthetic limbs on the basis of user intent from the 
brain is a fascinating concept. This requires an effective neural 
decoder that can accurately interpret user intent through human 
neuromuscular signals for prosthetic limb control. ML algorithms, 
ranging from simple linear classifiers to deep learning regression 
models, have been powerful neural decoding methods for recogniz- 
ing user intent regarding joint motions (e.g., wrist or knee flexion 
and extension), hand grasping patterns (e.g., fine pinch or power 
grip), or locomotion modes (e.g., sit-to-stand transition and level 
ground walking) (54, 55). The ML decoder output is sent to the 
prosthesis controller to produce the user’s intended limb motion to 
enable human-prosthesis symbiosis during task performance. 

Human-prosthesis symbiosis through the implementation of AI 
allows adaptation to various environments and contexts. Human 
hands can dexterously interact with objects of different sizes and 
materials; human legs can adapt to various terrains while walking. 
Therefore, a symbiotic prosthetic limb should also be environmen- 
tally adaptive. Machine vision has been adopted to create environ- 
mental awareness for prosthesis control. Through deep learning 
algorithms applied to images captured by cameras mounted 
on a prosthetic hand, machine vision can recognize the intended 
grasping object, which allows the prosthesis arm to prepare the 
appropriate wrist posture and hand grasping pattern or force to fa- 
cilitate grasping actions (56, 57). Similarly, vision sensors mounted 
on prosthetic legs can recognize the terrain in front of the user, 
which autonomously adapts prosthesis control accordingly for 
seamless terrain transitions (58, 59). 

A symbiotic prosthesis needs to provide personalized assis- 
tance to each user owing to the large inter-amputee differences 
in their physical conditions and motor deficits. In current clin- 
ics, personalization of robotic lower-limb prosthesis control is 
performed manually and heuristically, which is inaccurate and 
time- and labor-intensive. To automate the process, researchers 
have developed RL algorithms and other data-driven optimiza- 
tion approaches, such as Bayesian optimization, to tune prosthe- 
sis control with the human-in-the-loop for personalized walking 
assistance (60). For RL-based algorithms, the prosthesis personal- 
ization procedure can be as short as 5 minutes, and the resulting 
smart AI tuning agent can continue producing user-adaptive 
control over different time frames (61). 

Although AI in robotic prosthetics has shown great prom- 
ise, AI needs to be more robust and safer for daily prosthesis 
control owing to having the human-in-the-loop. Additionally, it 
is an open question whether human users cognitively embody 
and trust Al-enabled prostheses. These challenges should direct 
future research efforts toward making Al-enabled symbiotic, ro- 
botic prostheses versatile, safe to use, and cognitively acceptable 
by users with limb amputations. 
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On 15 March 2016, 
professional Go player 
Lee Sedol reviews the 
match after being 
defeated by AlphaGo. 


Improving artificial intelligence with games 


Games continue to drive progress in the development of artificial intelligence 


By Peter R. Wurman’, Peter Stone’, 
Michael Spranger? 


or thousands of years, humans have 

played board games as a _ pleasur- 

able way to pass time and match wits. 

Games such as Senet were so prized as 

to be buried with other valuables for 

the afterlife in Egyptian tombs ~3500 
BCE (J). Skill in strategic board games such as 
chess and Go came to represent a pinnacle of 
human intelligence, and thus it is no surprise 
that such board games have been viewed 
as yardsticks by which to measure machine 
intelligence (2). In recent years, computers 
have overtaken human skill in a wide array 
of strategy games. Although it may seem that 
the recent dominance of artificial intelligence 
(AI) in classic strategy games foreshadows 
the end of what games can teach us about AI, 
researchers are just beginning to tackle the 
substantially more complex and varied chal- 
lenges posed by modern video games. 

Games provide safe opportunities to iso- 
late and practice many skills that are trans- 
ferable to the real world. As such, they are 
valuable training grounds not only for people 
but also for intelligent machines. Strategic 
reasoning and problem solving can be 
learned from traditional games such as tic- 
tac-toe, checkers, chess, and Go. Probabilistic 
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reasoning can be learned from games such 
as backgammon, bridge, and poker. Social 
games such as Hanabi and Diplomacy teach 
cooperation and negotiation. Furthermore, 
video games substantially open up the space 
of skills that can be practiced, including run- 
ning businesses (lemonade stands or rail- 
roads), managing growing simulated cities, 
or racing virtual cars. 

Games such as checkers, chess, and Go 
represent the simplest class of games in that 
players have perfect information (all play- 
ers see all of the information); there are no 
hidden or stochastic elements. In theory, the 
game can be “solved” by computing the opti- 
mal play for both players, which would guar- 
antee a win or draw. Checkers (3), Connect 
Four, Gomoku, and nine men’s motris are 
examples of mathematically solved games. 
Chess and Go remain unsolved because of 
their number of possible board states. 

A weaker, but still impressive, demon- 
stration of skill is to beat the best human 
players in the world, as when Deep Blue de- 
feated reigning world chess champion Gary 
Kasparov in 1997 (4). In 2016, AlphaGo dem- 
onstrated similar superhuman performance 
in the game of Go (5). In tackling Go, a new 
stochastic search algorithm called Monte 
Carlo tree search (MCTS) was developed to 
handle large state spaces, which has had ben- 
efits for a variety of real-world optimization 
problems such as multirobot path planning 
and procedural content generation. 

In large games, MCTS alone cannot sam- 


ple enough possible futures to determine the 
value of early and middle game board posi- 
tions accurately. Instead, the search must be 
complemented with good estimates of the 
long-term value of intermediate game states. 
Notably, superhuman performance in Go was 
not achieved until neural networks were em- 
ployed to estimate the value of intermediate 
board positions, which made MCTS more ac- 
curate and efficient. 

Although a computer defeated a sitting 
world backgammon champion in 1979 (6), 
decades before computers won at chess or 
Go, backgammon represents a slightly more 
challenging class of game because of the ad- 
dition of random chance. In games of chance, 
the value of a node in the search tree must 
represent the expected value of that game 
state, that is, a weighted sum over all pos- 
sible ways in which the stochastic future of 
the game might unfold. The backgammon 
program’s performance was a high-profile 
demonstration of the power of using self-play 
and reinforcement learning (RL) to train a 
computer program. 

Games such as poker and Stratego add the 
complexity that the player has imperfect in- 
formation. Although a poker player knows 
their own cards, they do not know the oppo- 
nents’ cards or any shared cards that have not 
been exposed. Furthermore, in poker compe- 
titions, the goal is to win the most money 
over the long term, not just to win individual 
hands. Thus, an AI system must reason about 
not only the opponents’ current cards, but 
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also their long-term betting strategies. In 
game theory terms, it is an example of a re- 
peated game, where the interactions in one 
round affect the beliefs and strategies of the 
players in future rounds. Superhuman per- 
formance has been demonstrated in imper- 
fect information games such as Stratego and 
Texas hold’em no-limit poker (7). However, 
interestingly, bridge remains a rare example 
of a game that AI has not mastered, possibly 
owing to the complex nature of the inter- and 
intrateam communication required to play 
well with a partner. 

In an attempt to study more generally in- 
telligent “agents,” annual competitions were 
held between 2005 and 2016 to motivate 
research in game playing (8). The agents 
were presented with a logical definition of 
a new game in a formal game-description 
language and had to reason about not only 
how to play, but also how to represent the 
game in memory so that it can be searched 
and evaluated. More recently, MuZero (9), a 
single, unified system that could learn to play 
Atari, chess, Go, or shogi, was presented. It 
was followed soon after by Player of Games, 
which expanded the set to include imperfect- 
information games such as poker. 

Although the accomplishments of design- 
ing computer programs that can defeat the 
best players in the world is impressive, strat- 
egy games require a particular kind of intelli- 
gence. Specifically, these successful programs 
use some form of guided deep search and 
state-value estimation. Compared to humans, 
computers are much faster and more thor- 
ough at search, and recently became better 
at state-value estimation by using self-play 
to generate vast amounts of expert-level data 
that is fed into neural networks that serve as 
function approximators. 

By contrast, video games pose new types of 
challenges, many of which drive the develop- 
ment of systems that require more “common- 
sense” intelligence (10). Video games are a 
promising application area for work in gen- 
eral intelligence because they are complex 
simulations of an imaginary world, complete 
with (imaginary) physics and usually popu- 
lated with a variety of other players and non- 
player characters. They entail many of the 
challenges associated with building general- 
purpose robots without the real-world com- 
plexities and costs. Indeed, developing agents 
to play video games allows researchers to 
discover and start to address undesirable 
emergent behaviors in relatively low-stakes 
environments, rather than in business-criti- 
cal applications. 

To date, only a few AI systems have been 
developed for these domains. In the late 
1990s, games such as Quake and Doom were 
used to explore the power of rule-based AI 
systems. Things changed in 2015 when the 
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confluence of progress in image processing 
through neural networks and RL enabled an 
agent to play some classic Atari games (/7). 
Soon after, the suite of Atari games became 
a benchmark domain for AI researchers to 
compare RL algorithms. The Atari suite of 
games also exposed challenging situations, 
such as sparse rewards, that forced research- 
ers to continue to expand the capabilities of 
the techniques. These advances enabled re- 
searchers to tackle more modern and difficult 
video games, including the real-time strat- 
egy games StarCraft IT (12) and DotA 2 (13), 
and the highly realistic racing game Gran 
Turismo (GT) (14). All three of these projects 
culminated with evaluations against top e- 
sports gamers in which they demonstrated 
grand-master or superhuman performance. 
In all of these successes, machine learning 
allowed researchers to achieve performance 
levels that they could not reach with hand- 
coded rules. 

Although much has been learned about 
building Als for these particular games, the 
challenges posed by open-world games such 
as Minecraft, Horizon Forbidden West, or The 
Legend of Zelda: Tears of the Kingdom have 
yet to be conquered. These games require 
AI systems to learn diverse skills and apply 
them across enormous maps with varied ter- 
rain, enemies, and other challenges. They in- 
volve elements of combat, travel, economics, 
and creative puzzle solving. They have com- 
plex storylines, communicated in natural lan- 
guage, that unfold throughout the gameplay 
and require the player to leverage acquired 
skills and knowledge to make progress. The 
techniques needed to build agents for small 
parts of these games are available, and re- 
cent progress in generative AI may help with 
some of these challenges. However, the task 
of building a general-purpose agent architec- 
ture that can learn—especially in a human 
time scale—to play these sorts of games re- 
mains a grand challenge. 

Using video games as platforms for re- 
search in AI directly benefits the USD$200 
billion global video game industry and, 
eventually, could benefit development of the 
metaverse. Producing agents that are capable 
of playing modern video games unlocks the 
possibilities of much more capable, human- 
like nonplayer characters within the games 
themselves. For example, anecdotal evidence 
from online forums regarding the recent, 
time-limited deployment of the racing agent 
(GT Sophy) in Gran Turismo 7 suggests that 
players felt that racing against GT Sophy pro- 
vided an experience that was much more like 
racing other skilled humans than the game’s 
default, hand-coded AI. As the technology 
progresses, the gaming community could 
benefit from more-intelligent agents by using 
them as opponents and teammates in online 


games, as in-game tutors, as automated tes- 
ters to find bugs in games, or by designers to 
ensure game balance. 

In the physical world, games often take the 
form of sports. Research on robots that can 
play sports, such as the ongoing RoboCup 
(15) robotic soccer competitions and the 
autonomous race cars being developed in 
the Indy Autonomous Challenge, simulta- 
neously spur progress on the frontiers of AI 
and robotics. In both examples, much of the 
research occurs in simulations because the 
cost and complexity of keeping real robots 
running preclude them from being used as 
the sole experimental platform. Bridging the 
sim-to-real gap, a concept that describes the 
challenges of moving a policy from simula- 
tion to real robots, has become a topic of re- 
search on these projects. Fortunately, owing 
in large part to decades of efforts to improve 
the graphical and physical realism of video . 
games, the fidelity of robot simulations has 
continued to improve, thereby reducing sim- 
to-real gaps. 

Making progress on AI within video games 
will also be a substantial step toward much 
more capable and flexible AI systems that op- 
erate in the physical world. In contrast to spe- 
cial-purpose robots, general-purpose robots 
need the ability to continually adapt to their 
environments and quickly learn new tasks. 
Modern, open-world video games allow AI re- * 
searchers to focus on fundamental AI topics 
such as multitask learning, life-long learning, 
common-sense reasoning, human-AI collabo- 
ration, and explainability, while leaving the 
challenges raised by real-world sensors and 
actuators to roboticists. As robotic simulators 
become even more realistic, the sim-to-real 
gap will continue to shrink, making it more 
likely that the techniques that work in games 
and simulations will be transferable to the *‘ 
real world. : 


REFERENCES AND NOTES 


1. W.Cristetal.,Ancient Egyptians at Play: Board Games 
Across Borders (Bloomsbury, 2016). 

. N.Wiener, Cybernetics (Wiley, 1948). 

. J. Schaeffer et al., Science 317, 1518 (2007). 

M. Campbell, A. J. Hoane Jr., F. Hsu, Artif. Intell.134, 57 

(2002). 

. D. Silver et al., Nature 529, 484 (2016). 

G. Tesauro, Neural Comput. 6, 215 (1994). 

. N. Brown, T. Sandholm, Science 359, 418 (2018). 

. M.Genesereth, Y. Bjornsson, Al Mag. 34, 107 (2013). 

. J. Schrittwieser et al., Nature 588, 604 (2020). 

. J. Laird etal., Al Mag. 22,15 (2001). 

. V.Mnihetal., Nature 518,529 (2015). 

0. Vinyals et al., Nature 575, 350 (2019). 

M. Hutson, Science 361, 632 (2018). 

P.R.Wurmanet al., Nature 602,223 (2022). 

. H. Kitano et al., Proceedings. of the First International 
Conference on Autonomous Agents (1997), pp. 340-347. 


ABRDBNEOOMYHDH RWHP 


ACKNOWLEDGMENTS 


Allauthors are coauthors on patents and receive stock 
options as employees of Sony. The employment of P.S. at 
Sony has been reviewed and approved by the University of 
Texas at Austin in accordance with its policy on objectivity 
in research. 


10.1126/science.adh8135 


science.org SCIENCE 


PERSPECTIVE - HYPOTHESIS 


Implications of predicting race 
variables from medical images 


Al-predicted race variables pose risks and opportunities 


for studying health disparities 


By James Zou’, Judy Wawira Gichoya?, 
Daniel E. Ho’, Ziad Obermeyer* 


here are now more than 500 US Food 

and Drug Administration (FDA)- 

approved medical artificial intel- 

ligence (AI) devices, and AI is used 

in diverse medical tasks such as as- 

sessing the risk of heart failure and 
diagnosing cancer from images (7). Beyond 
predicting standard diagnoses, AI models 
can infer a substantial number of patient 
features from medical images in ways that 
humans cannot. For example, several stud- 
ies have demonstrated that some AI models 
can infer race variables (crude simplistic 
categories) directly from medical im- 
ages such as chest x-rays and cardiac 
ultrasounds, even though there are 
no known human-readable race cor- 
relates in these images (2-4). This has 
raised concerns about the possibility 
of AI systems to discriminate. At the 
same time, AI predictions of demographic 
attributes, including race variables, also 
have the potential to help assess and moni- 
tor health care disparities and generate new 
insights into risk factors (5). 

Race is a complex and problematic so- 
cial construct that does not have a bio- 
logical basis (6). The concept of race varies 
across time and geography and according 
to social factors. Demographic-prediction 
AI models can impute (or infer) race vari- 
ables that are captured in electronic health 
records. These are simplistic and dated 
reporting categories, such as white, Black, 
and Asian. Despite such limitations, mea- 
sures of health care equity and algorithmic 
fairness often depend on stratifying out- 
comes by race variables. 

How AI predicts race variables is puz- 
zling. It is likely that other demographic 
attributes, comorbidities, or imaging ar- 
tifacts are at play. For example, AI can 
predict patients’ age and sex from cardiac 
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ultrasound videos because heartbeat pat- 
terns vary with age and sex (3). Therefore, 
if patients of one race skew older or 
younger in the training dataset, then the 
model can use this correlation to predict 
race. Measured confounders (variables 
that may cause spurious association) that 
could lead to correlations can be adjusted 
for directly. But in a study of chest x-rays, 
measured potential confounders were bal- 
anced across the dataset, and a deep learn- 
ing model was still able to predict race 
variables from the images, achieving an 
area under the receiver operating charac- 
teristic curve (AUROC) above 0.9 (AUROC 
is acommon metric of prediction accuracy 


“athe ability of Al to predict race variables 
from medical images could be useful 
for monitoring health care disparity...” 


where 1 is perfect) (2). This suggests that 
unmeasured confounders also play a role. 
Moreover, illnesses can be unequally dis- 
tributed across racial groups; if those ill- 
nesses have imaging manifestations, algo- 
rithms could also use them to predict race. 

Could the ability of AI to predict race 
variables from medical images, whatever 
the mechanism, exacerbate health care 
disparities? Although diagnostic AI is 
not trained to directly read out race vari- 
ables, it is possible that race or its corre- 
lates could be a shortcut that is used by 
AI algorithms to make disease diagnoses. 
There are some settings where this is pos- 
sible and potentially harmful. For example, 
suppose the patients of one race are more 
likely to have their images taken by type 
A machines and, because of differences in 
where the machines are deployed, most 
of the positive cases in the training data 
are collected from type B machines. AI can 
learn to use machine type to predict race 
variables and disease status (it is less likely 
to predict disease for images taken from 
machine A). In deployment, the AI would 
then underdiagnose patients of a particu- 
lar race, leading to care disparity. This 
raises serious questions of algorithmic dis- 


crimination, especially if the AI uses cid 
race proxy in its assessment and clinictans— 
are not aware of this. 

Therefore, it is critical to evaluate the 
reliability of AI in truly independent and 
representative datasets in which race vari- 
ables are available—for example, from pa- 
tient self-reporting or linked administra- 
tive data. Ideally, this dataset would come 
from multiple settings or health systems to 
avoid overfitting. Then, equity can be mea- 
sured by disaggregating the model’s per- 
formance by race variables to ensure that it 
does not have excess errors or differential 
performance between groups. 

The ability of AI to pick up hidden phe- 
notypes, such as race variables, from medi- 
cal data could also be a resource for equity. 
AI can generate valuable information for 
auditing datasets to detect biases (7). For 
example, if an AI model can accurately pre- . 
dict attributes that may not be medically 
relevant (such as race, or the machine type 
that took the image), this suggests possible 
confounders in the training dataset. Maybe 
there are spurious correlations involving 
race variables in the data, or parts of the 
hospital have different imaging setups. In 
such cases, these confounders should 
be explicitly accounted for in subse- 
quent analyses. Unexpected predic- 
tions by AI force these data warts 
into the open and could improve 
transparency. 

In many settings, race variables are 
not available, but laws in numerous coun- 
tries require assessment of disparities. 
How can it be ensured that a model, treat- 
ment, or policy benefits a diverse popula- 
tion if the relevant demographic attributes 
are missing? Predicted race variables from 
an AI algorithm could be a useful, albeit 
imprecise, proxy. For example, if research- 
ers develop a method to diagnose Long 
Covid from chest x-ray images, but they do 
not have race metadata for many images ° 
in their dataset, a demographic-prediction 
AI model could impute race variables from 
their images. The model’s performance 
when stratified by race variables can then 
be assessed to ensure that it works well 
across diverse populations. There is prec- 
edent for this imputation approach: The 
US Centers for Medicare and Medicaid 
Services use race inferred from surname 
and location to monitor health care dis- 
parity (8), and the Consumer Financial 
Protection Bureau uses a similar approach 
to monitor equity in lending (9). The 
Internal Revenue Service is statutorily pro- 
hibited from using Census demographic 
data to assess disparities in tax administra- 
tion, and the US Treasury Department has 
in turn relied on imputed race variables to 
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uncover important disparities in tax ex- 
penditures and audits (10). 

The usage of imputed race variables in 
health care should focus on disparity as- 
sessment and algorithmic auditing instead 
of medical decision-making. The accuracy 
of imputations should be carefully vali- 
dated in each new application. When the 
imputed race variables are broad catego- 
ries, downstream analysis may miss dispar- 
ities within subgroups of the population. 
Furthermore, in applications involving 
interpretations of DNA sequencing, other 
population labels such as genetic ancestry 
are more meaningful categories for assess- 
ment than race variables (//). 

AI could help to provide more nuanced 
representations of race than the standard 
reporting categories, which are often very 
coarse. AI produces continuous scores or 
probabilities, not categories. This mirrors 
work in human genetics showing that it 
is more accurate to model populations as 
continuous spectrums of variation in an- 
cestry rather than modeling according to 
a few discrete race labels (12). AI models, 
such as neural networks, also learn to rep- 
resent individuals as points in a continu- 
ous space (often called embedding) (J3). 
Instead of treating, for example, “Asian” as 
a single discrete category, an AI model that 
assesses chest x-rays maps images from 
Asian patients to a continuous geometric 
space before making its diagnosis predic- 
tion. Variation in this space could reflect 
ancestry subgroups within Asians as well 
as differences in health care access, comor- 
bidities, and other unmeasured factors. 
More research is needed to characterize 
the implications for health care from dif- 
ferences between patient embedding. 

AI represents patients on a continuous 
spectrum rather than as discrete racial 
categories. This makes it easier to evaluate 
and ensure that the model makes accurate 
predictions across fine-grained subpopula- 
tions. Instead of measuring how well the 
AI model performs on Asians as one group, 
which can mask limitations on less popu- 
lous subgroups, the model’s performance 
can be assessed on different clusters or 
spectra of patients in the embedding space. 
Suppose the model makes more mistakes 
than expected in a particular data cluster; 
further investigation of what subpopula- 
tion is represented in this error cluster can 
be undertaken, and the model could be 
retrained to improve on this subgroup. In 
response to advocacy by civil rights groups, 
the White House recently proposed updat- 
ing the 1997 coarse race reporting stan- 
dard to foster data disaggregation accord- 
ing to reporting outcomes by subgroups 
(for example, Vietnamese American versus 


150 = 14 JULY 2023 + VOL 381 ISSUE 6654 


Asian American). It may take years for that 
process to materialize in health data, and 
AI imputations can potentially help to un- 
cover health care disparities between gran- 
ular subgroups in the meantime (/4). 

Although race variables are not a gener- 
ally meaningful category in medicine, the 
ability of AI to predict race variables from 
medical images could be useful for monitor- 
ing health care disparity and ensuring that 
algorithms work well across diverse popu- 
lations. Where feasible, researchers can 
try to predict race and other demographic 
variables from their data as a mechanism to 
thoroughly audit their dataset and model. 
If race variables and other attributes can 
be accurately predicted, this could reveal 
potential data or model biases that should 
be addressed. Understanding what features 
AI mechanistically uses to predict race 
variables is important, and AI interpreta- 
tion methods such as concept activation, 
saliency map, and counterfactual explana- 
tions could help. Overall, being transparent 
and deliberate about race variable predic- 
tions should reduce the risk of algorithmic 
discrimination and could generate valuable 
information about the dataset, the model, 
and health care disparities. 
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Mitigating bias 
in Al at the 
point of care 


Promoting equity in Al 
in health care requires 
addressing biases at 
clinical implementation 


By Matthew DeCamp! and 
Charlotta Lindvall? 


rtificial intelligence (AI) shows 
promise for improving basic and 
translational science, medicine, and 
public health, but its success is not 
guaranteed. Numerous examples 
have arisen of racial, ethnic, gen- 
der, disability, and other biases in AI ap- 
plications to health care. In ethics, bias 
generally refers to any systematic, unfair 
favoring of people in terms of how they 
are treated or the outcomes they experi- 
ence. Consensus has emerged among sci- 
entists, ethicists, and policy-makers that 
minimizing bias is a shared responsibility 
among all involved in AI development. For 
example, ensuring equity by eliminating 
bias in AI is a core principle of the World 
Health Organization for governing AI (1). 
But ensuring equity will require more than 
unbiased data and algorithms. It will also 
require reducing biases in how clinicians 
and patients use Al-based algorithms—a 
potentially more challenging task than re- 
ducing biases in algorithms themselves. 
Examination of bias in AI has tended to- 
ward removing bias from datasets, analy- 
ses, or AI development teams. For example, 
because of unequal recruitment and en- 
rollment, oncology datasets demonstrate 
racial, ethnic, and global geographic biases 
(2). In another example, developers as- 
sumed that health care costs were a proxy 
for health care needs, but then learned 
that Black Americans often receive less 
medical care even when they have greater 
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uncover important disparities in tax ex- 
penditures and audits (10). 

The usage of imputed race variables in 
health care should focus on disparity as- 
sessment and algorithmic auditing instead 
of medical decision-making. The accuracy 
of imputations should be carefully vali- 
dated in each new application. When the 
imputed race variables are broad catego- 
ries, downstream analysis may miss dispar- 
ities within subgroups of the population. 
Furthermore, in applications involving 
interpretations of DNA sequencing, other 
population labels such as genetic ancestry 
are more meaningful categories for assess- 
ment than race variables (//). 

AI could help to provide more nuanced 
representations of race than the standard 
reporting categories, which are often very 
coarse. AI produces continuous scores or 
probabilities, not categories. This mirrors 
work in human genetics showing that it 
is more accurate to model populations as 
continuous spectrums of variation in an- 
cestry rather than modeling according to 
a few discrete race labels (12). AI models, 
such as neural networks, also learn to rep- 
resent individuals as points in a continu- 
ous space (often called embedding) (J3). 
Instead of treating, for example, “Asian” as 
a single discrete category, an AI model that 
assesses chest x-rays maps images from 
Asian patients to a continuous geometric 
space before making its diagnosis predic- 
tion. Variation in this space could reflect 
ancestry subgroups within Asians as well 
as differences in health care access, comor- 
bidities, and other unmeasured factors. 
More research is needed to characterize 
the implications for health care from dif- 
ferences between patient embedding. 

AI represents patients on a continuous 
spectrum rather than as discrete racial 
categories. This makes it easier to evaluate 
and ensure that the model makes accurate 
predictions across fine-grained subpopula- 
tions. Instead of measuring how well the 
AI model performs on Asians as one group, 
which can mask limitations on less popu- 
lous subgroups, the model’s performance 
can be assessed on different clusters or 
spectra of patients in the embedding space. 
Suppose the model makes more mistakes 
than expected in a particular data cluster; 
further investigation of what subpopula- 
tion is represented in this error cluster can 
be undertaken, and the model could be 
retrained to improve on this subgroup. In 
response to advocacy by civil rights groups, 
the White House recently proposed updat- 
ing the 1997 coarse race reporting stan- 
dard to foster data disaggregation accord- 
ing to reporting outcomes by subgroups 
(for example, Vietnamese American versus 
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Asian American). It may take years for that 
process to materialize in health data, and 
AI imputations can potentially help to un- 
cover health care disparities between gran- 
ular subgroups in the meantime (/4). 

Although race variables are not a gener- 
ally meaningful category in medicine, the 
ability of AI to predict race variables from 
medical images could be useful for monitor- 
ing health care disparity and ensuring that 
algorithms work well across diverse popu- 
lations. Where feasible, researchers can 
try to predict race and other demographic 
variables from their data as a mechanism to 
thoroughly audit their dataset and model. 
If race variables and other attributes can 
be accurately predicted, this could reveal 
potential data or model biases that should 
be addressed. Understanding what features 
AI mechanistically uses to predict race 
variables is important, and AI interpreta- 
tion methods such as concept activation, 
saliency map, and counterfactual explana- 
tions could help. Overall, being transparent 
and deliberate about race variable predic- 
tions should reduce the risk of algorithmic 
discrimination and could generate valuable 
information about the dataset, the model, 
and health care disparities. 
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rtificial intelligence (AI) shows 
promise for improving basic and 
translational science, medicine, and 
public health, but its success is not 
guaranteed. Numerous examples 
have arisen of racial, ethnic, gen- 
der, disability, and other biases in AI ap- 
plications to health care. In ethics, bias 
generally refers to any systematic, unfair 
favoring of people in terms of how they 
are treated or the outcomes they experi- 
ence. Consensus has emerged among sci- 
entists, ethicists, and policy-makers that 
minimizing bias is a shared responsibility 
among all involved in AI development. For 
example, ensuring equity by eliminating 
bias in AI is a core principle of the World 
Health Organization for governing AI (1). 
But ensuring equity will require more than 
unbiased data and algorithms. It will also 
require reducing biases in how clinicians 
and patients use Al-based algorithms—a 
potentially more challenging task than re- 
ducing biases in algorithms themselves. 
Examination of bias in AI has tended to- + 
ward removing bias from datasets, analy- 
ses, or AI development teams. For example, 
because of unequal recruitment and en- 
rollment, oncology datasets demonstrate 
racial, ethnic, and global geographic biases 
(2). In another example, developers as- 
sumed that health care costs were a proxy 
for health care needs, but then learned 
that Black Americans often receive less 
medical care even when they have greater 
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need (3); the resulting algorithm would 
have cemented this structural racism into 
place. Moreover, encouraging diversity and 
inclusivity in researchers is another way to 
manage AI bias (4). 

Alone these efforts are insufficient. 
Mitigating bias when implementing AI 
suffers from a more challenging problem: 
emergent biases, i.e., biases that emerge 
not merely because of biased datasets or 
algorithms, but because of factors involved 
in real-world implementation (5). The ben- 
efits or harms of an algorithm, even one 
agreed upon as “fair,” are likely to be ex- 
perienced unequally by patients. These 
harms can be seen as latent biases (i.e., 
biases waiting to happen) and the factors 
involved as latent conditions [i.e., charac- 
teristics of the environment that contrib- 
ute to bias (6)]. Clinician-, patient-, and 


to follow AI recommendations. Still, early 
adopters of technology will be more likely 
to incorporate Al-based recommendations 
compared to late adopters, meaning that 
patients cared for by early adopters will 
be more likely to experience benefits or 
harms of AI sooner. Clinicians familiar with 
AI may be more (or less) likely to incorpo- 
rate AI. Age, experience, and other factors 
will thus influence whether or not a clini- 
cian follows an AI-based recommendation, 
though perhaps not in a consistent pattern. 

Moreover, that a model is “AI” could in- 
fluence clinicians. Traditional CDSSs are 
based on expert knowledge and guidelines, 
whereas AlI-based models rely primarily 
on statistical associations. Statistical as- 
sociations can be built upon variables or 
their interactions that clinicians might not 
know to include in their decision-making. 


How a “fair” algorithm can result in biased outcomes 

Aclinician and patient interact with artificial intelligence (Al)—-based decision support that provides information 
about, for example, the likelihood of a diagnosis, utility of a treatment, or a prognosis. Even if the algorithm 

is unbiased, clinician-, patient-, and social-level factors can influence how the recommendations are interpreted 
and implemented, which can result in latent treatment bias. 


“Fair” algorithm 


Clinician factors 
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Familiarity with Al 

Subconscious bias 


social-level factors can interact to create 
biases (see the figure). 

Clinician-level factors will play complex 
roles in determining whether and how cli- 
nicians choose to use Al-derived informa- 
tion. Most physicians do not blindly adopt 
and follow clinical decision support sys- 
tems (CDSSs) or recommendations based 
on algorithms. However, the impacts of 
various factors are not always obvious. 

Although not specific to AI, in some prior 
studies, older physicians are less likely to 
follow algorithm-based recommendations, 
whereas in other studies, younger physi- 
cians override decision support more (7). 
Thus, clinical experience or perceived ex- 
pertise alone may not predict willingness 
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AI algorithms (e.g., deep neural networks) 
often lack transparency or interpretability 
regarding how data are transformed into 
model outputs, sometimes referred to as 
“black box” algorithms. Because medi- 
cine places value on expert and experien- 
tial knowledge, statistical or “black box” 
knowledge could be discounted in deci- 
sions. How clinicians perceive AlI-based 
tools—in terms of how a recommenda- 
tion is presented, transparency surround- 
ing the motivation behind the AI and its 
data sources, and its explainability—could 
influence clinicians’ willingness to use AI. 
As a result, differential use and uptake of 
AI should be expected among clinicians. 
This means that patients under individual 


clinicians’ care will experience the benefits 
and risks of AI in systematically different 
ways—an issue of potential bias. 

Complicating matters further, patient- 
level factors will affect how patients ex- 
perience AI. For one, because of socioeco- 
nomic status, race, ethnicity, geography, 
and other factors, inequities in access to 
clinicians, hospitals, and other services 
are a problem for all health systems world- 
wide. This includes unequal access to 
Al-based technology. Additionally, some 
patients may trust technology more than 
others, and some may have less technology 
literacy. Patients who distrust a clinician 
or health system may be less likely to trust 
that clinician’s or health system’s use of AI, 
and patient distrust is not randomly dis- 
tributed: It associates with race, ethnicity, 
education, and other factors. If given the 
choice, such patients may simply opt out 
of AI in their care (8). 

The severity of patients’ illnesses or 
symptom type may affect clinicians’ will- 
ingness to follow computer-based recom- 
mendations. For example, for a patient in 
the United States who identifies as Black 
and presents with a symptom of pain, data 
suggest that clinicians may be more likely 
to override decision support based on a pa- 
tient’s symptom [e.g., certain types of pain 
(7)] and also to interpret reports of pain dif- 
ferently based on race (9). Therefore, differ- 
ential following of Al-recommended treat- 
ment plans according to patient race could 
differentially harm or help Black patients. 

A third source of factors that contribute 
to emergent biases comes from the social 
and policy context. Health care financing 
mechanisms vary widely, and the affordabil- 
ity and accessibility of AI and what it rec- 
ommends will influence its effects. Because 
medical liability is defined on the basis of 
current standards of care that do not yet in- 
volve AI, the safest way for clinicians to use 
AI will be as confirmation for the existing 
standard of care (10). Early uses of AI may 
thus suffer from status quo bias, hindering 
the potential of AI to result in innovation or 
changes to care plans for patients. 

Clinicians may either rely more on the 
technology or ignore it completely in time- 
pressured clinical environments. This auto- 
mation bias (over- or under-reliance on the 
AI) may exacerbate disparities for under- 
resourced settings where time and financial 
pressures are more intense. In some settings, 
clinicians may be more willing to trust and 
use AI that is developed or endorsed by their 
own hospital or health system, compared to 
commercially developed AI. 

As latent biases emerge from clinical im- 
plementation, they will feed back into learn- 
ing models, potentially negatively affecting 
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the performance of the model itself (JJ). 
Increasing clinician awareness of AI’s biases 
is critical, but this desire may be paradoxical: 
Knowing about biases in AI may result in 
less willingness to use AlI-based recommen- 
dations for patients that a clinician judges 
“different” from others. Assuming models 
are biased in terms of race or ethnicity, for 
example, could result in clinicians systemati- 
cally overriding a model’s recommendation 
for that group of patients. 

Several strategies exist to identify and 
address latent biases. One strategy could 
involve providing clinicians with model- 
specific, individual-level performance 
feedback regarding whether they tend to 
outperform or underperform it, or if they 
are systematically following or overriding 
a model only for certain patient groups. 
Individualized feedback has the potential 
to improve clinician performance (12). 
However, a challenge for assessing bias is 
that clinicians may not see sufficient num- 
bers of patients in different groups to al- 
low rigorous, stratified comparisons. 

Patients should be informed about the 
use of AI in their clinical care as a matter 
of respect. This includes general messag- 
ing about the use of predictive algorithms, 
chatbots, and other Al-based technologies, 
and specific notification when new AI- 
based technologies are used in their indi- 
vidual care. Doing so may improve aware- 
ness of AI, motivate conversations with 
clinicians, and support greater transpar- 
ency around AI use. 

Exactly how much to disclose, and in what 
format, are unanswered questions that re- 
quire additional research. There is a need to 
avoid AI exceptionalism—the idea that AI is 
riskier or requires greater protection, just be- 
cause it is AI—and presently patients want to 
know more, not less (8). That other decisions 
relying on algorithms, such as clinical risk 
calculators or computer-aided radiographic 
or electrocardiogram interpretation, may 
not be routinely shared with patients is not 
an argument in favor of secrecy. 

Bias has not been a major aspect of drug 
and device regulations, which focus on 
overall safety and efficacy. Recent US pro- 
posals could extend legal liability to phy- 
sicians and hospitals, meaning they could 
be required to provide compensatory dam- 
ages to patients or be subject to penalties 
for use of biased clinical algorithms; these 
could be applied to AI algorithms (/3). 
However, the complexity of AI algorithms 
and persistent ethical disagreement over 
when differential performance by race or 
ethnicity equals true bias complicate liabil- 
ity proposals. Drug and device regulatory 
agencies might consider making evalua- 
tions of bias mandatory for approval (74). 
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A first step could be requiring evaluations 
of differential performance and bias under 
different real-world assumptions in ap- 
proval processes and other forums, such as 
in journal reporting of AI research. 

In addition, the gaze of AI should be 
turned on itself. This requires proactive, 
intentional development of AI tools to iden- 
tify biases in AI and in its clinical imple- 
mentation (J5). AI may contribute to the 
emergence of biases, but it also has the po- 
tential to detect biases and hence facilitate 
new ways of overcoming them. Open-source 
tools, such as AI Fairness 360, FairML, and 
others, show promise in helping research- 
ers assess fairness in their machine learning 
data and algorithms. These tools can assess 
biases in datasets, predictive outputs, and 
even the different techniques that can be 
used to mitigate bias according to differ- 
ent metrics of fairness. Their application to 
health care data and algorithms deserves 
rigorous scientific examination. 

Implementation research is urgently 
needed to better understand the role of dif- 
ferent contextual factors and latent condi- 
tions in allowing biases to emerge. Exactly 
which patients may experience bias under 
which circumstances requires ongoing rig- 
orous study. In AI, biased data and biased 
algorithms result in biased outcomes for 
patients, but so do unbiased data and al- 
gorithms when they enter a biased world. 
All patients deserve to benefit from both 
fair algorithms and fair implementation. 
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he past few years have seen a surge 

of interest in using machine learning 

(ML) methods for studying the be- 

havior of nonhuman animals (here- 

after “animals”) (J). A topic that has 

attracted particular attention is the 
decoding of animal communication systems 
using deep learning and other approaches 
(2). Now is the time to tackle challenges con- 
cerning data availability, model validation, 
and research ethics, and to embrace opportu- 
nities for building collaborations across disci- 
plines and initiatives. 

Researchers must infer the meaning, or 
function, of animal signals through observa- 
tion and experimentation (3). This is a chal- 
lenging task, not least because animals use 
a wide range of communication modalities, 
including visual, acoustic, tactile, chemical, 
and electrical signals—often in conjunction, 
and beyond humans’ perceptive capabilities. 
Observational work focuses on recording the 
signals of interest as well as detailed contex- 
tual information, including the identity, state, 
and behavior of both the senders and receiv- 
ers of signals, their relationships and past in- 
teractions, and relevant environmental con- 
ditions. Some signal types may be produced 
only under certain circumstances, eliciting 
a specific behavioral response; a classic ex- 
ample is a vervet monkey (Chlorocebus py- 
gerythrus) giving an alarm call when it spots 
a predator, which causes group members to 
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the performance of the model itself (JJ). 
Increasing clinician awareness of AI’s biases 
is critical, but this desire may be paradoxical: 
Knowing about biases in AI may result in 
less willingness to use AlI-based recommen- 
dations for patients that a clinician judges 
“different” from others. Assuming models 
are biased in terms of race or ethnicity, for 
example, could result in clinicians systemati- 
cally overriding a model’s recommendation 
for that group of patients. 

Several strategies exist to identify and 
address latent biases. One strategy could 
involve providing clinicians with model- 
specific, individual-level performance 
feedback regarding whether they tend to 
outperform or underperform it, or if they 
are systematically following or overriding 
a model only for certain patient groups. 
Individualized feedback has the potential 
to improve clinician performance (12). 
However, a challenge for assessing bias is 
that clinicians may not see sufficient num- 
bers of patients in different groups to al- 
low rigorous, stratified comparisons. 

Patients should be informed about the 
use of AI in their clinical care as a matter 
of respect. This includes general messag- 
ing about the use of predictive algorithms, 
chatbots, and other Al-based technologies, 
and specific notification when new AI- 
based technologies are used in their indi- 
vidual care. Doing so may improve aware- 
ness of AI, motivate conversations with 
clinicians, and support greater transpar- 
ency around AI use. 

Exactly how much to disclose, and in what 
format, are unanswered questions that re- 
quire additional research. There is a need to 
avoid AI exceptionalism—the idea that AI is 
riskier or requires greater protection, just be- 
cause it is AI—and presently patients want to 
know more, not less (8). That other decisions 
relying on algorithms, such as clinical risk 
calculators or computer-aided radiographic 
or electrocardiogram interpretation, may 
not be routinely shared with patients is not 
an argument in favor of secrecy. 

Bias has not been a major aspect of drug 
and device regulations, which focus on 
overall safety and efficacy. Recent US pro- 
posals could extend legal liability to phy- 
sicians and hospitals, meaning they could 
be required to provide compensatory dam- 
ages to patients or be subject to penalties 
for use of biased clinical algorithms; these 
could be applied to AI algorithms (/3). 
However, the complexity of AI algorithms 
and persistent ethical disagreement over 
when differential performance by race or 
ethnicity equals true bias complicate liabil- 
ity proposals. Drug and device regulatory 
agencies might consider making evalua- 
tions of bias mandatory for approval (74). 
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A first step could be requiring evaluations 
of differential performance and bias under 
different real-world assumptions in ap- 
proval processes and other forums, such as 
in journal reporting of AI research. 

In addition, the gaze of AI should be 
turned on itself. This requires proactive, 
intentional development of AI tools to iden- 
tify biases in AI and in its clinical imple- 
mentation (J5). AI may contribute to the 
emergence of biases, but it also has the po- 
tential to detect biases and hence facilitate 
new ways of overcoming them. Open-source 
tools, such as AI Fairness 360, FairML, and 
others, show promise in helping research- 
ers assess fairness in their machine learning 
data and algorithms. These tools can assess 
biases in datasets, predictive outputs, and 
even the different techniques that can be 
used to mitigate bias according to differ- 
ent metrics of fairness. Their application to 
health care data and algorithms deserves 
rigorous scientific examination. 

Implementation research is urgently 
needed to better understand the role of dif- 
ferent contextual factors and latent condi- 
tions in allowing biases to emerge. Exactly 
which patients may experience bias under 
which circumstances requires ongoing rig- 
orous study. In AI, biased data and biased 
algorithms result in biased outcomes for 
patients, but so do unbiased data and al- 
gorithms when they enter a biased world. 
All patients deserve to benefit from both 
fair algorithms and fair implementation. 
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he past few years have seen a surge 

of interest in using machine learning 

(ML) methods for studying the be- 

havior of nonhuman animals (here- 

after “animals”) (J). A topic that has 

attracted particular attention is the 
decoding of animal communication systems 
using deep learning and other approaches 
(2). Now is the time to tackle challenges con- 
cerning data availability, model validation, 
and research ethics, and to embrace opportu- 
nities for building collaborations across disci- 
plines and initiatives. 

Researchers must infer the meaning, or 
function, of animal signals through observa- 
tion and experimentation (3). This is a chal- 
lenging task, not least because animals use 
a wide range of communication modalities, 
including visual, acoustic, tactile, chemical, 
and electrical signals—often in conjunction, 
and beyond humans’ perceptive capabilities. 
Observational work focuses on recording the 
signals of interest as well as detailed contex- 
tual information, including the identity, state, 
and behavior of both the senders and receiv- 
ers of signals, their relationships and past in- 
teractions, and relevant environmental con- 
ditions. Some signal types may be produced 
only under certain circumstances, eliciting 
a specific behavioral response; a classic ex- 
ample is a vervet monkey (Chlorocebus py- 


gerythrus) giving an alarm call when it spots 


a predator, which causes group members to 
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seek shelter. Establishing such correlations 
enables the formulation of hypotheses about 
signal function that can then be tested exper- 
imentally (e.g., with controlled playbacks). 

Following this approach, decades of care- 
ful research have produced major advances 
in understanding animal communication (3). 
But there are considerable challenges, such 
as avoiding anthropocentric biases in data 
collection and interpretation, processing 
ever-increasing volumes of data, charting the 
full complexity of animals’ signaling behav- 
ior, and achieving comprehensive functional 
decoding. ML offers some potential solutions. 

Animal signals can be investigated us- 
ing a rich toolkit of increasingly powerful 
ML methods, which vary in their modeling 
objectives, data requirements, and reliance 
on expert annotation. This includes, among 
other approaches, supervised learning (e.g., 
for determining which features accurately 
predict human-labeled signal types) and un- 
supervised and self-supervised learning (e.g., 
for discovering signal repertoires of an indi- 
vidual, group, or population). 

Self-supervised deep-learning methods (4) 
are of interest because they require neither 
annotated datasets nor predefining features 
that are potentially relevant for communica- 
tion. They are also the basis of “foundation 
models,” which are capable of remarkable 
generalizations across tasks (5). For exam- 
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ple, large language models that have been 
trained to predict the next word from a given 
sequence of words can subsequently be used 
to carry out much more complex tasks, such 
as inferring the syntactic classes of, and rela- 
tions between, linguistic units, or generating 
realistic text (5). 

Methods that can integrate different data 
modalities seem particularly promising for 
facilitating functional decoding because they 
can provide a fuller account of communi- 
cation events. ML models have been devel- 
oped that efficiently learn to link images to 
words, words to speech, and content across 
other modality combinations (5), and this ap- 
proach could be applied productively to ani- 
mal study systems, for example, by correlat- 
ing vocalizations with specific behaviors. ML 
would effectively assist with the challenging 
task of detecting cross-modal associations 
(and structure) that can, in turn, inform the 
design of validation experiments to establish 
causality (see the figure). 

Because many ML methods were originally 
developed for natural language processing, 
exciting avenues have started opening up for 
exploring the much-debated potential simi- 
larities between human language and animal 
communication systems (6). Observations 
and experimental work suggest that at least 
some animals, such as southern pied bab- 
blers (Turdoides bicolor), exhibit some of the 


order sensitivity and compositionality that 
are characteristic of human language (7). ML 
approaches could leverage large datasets to 
search for subtlety and complexity that elude 
traditional methods, potentially expanding 
the known set of communication features 
shared across divergent taxa. 

There is a growing number of studies that 
are exploiting the potential of ML for inves- 
tigating animal communication, including 
large collaborative initiatives, such as the 
Earth Species Project (ESP); Communication 
and Coordination Across Scales (CCAS); 
Vocal Interactivity in-and-between Humans, 
Animals and Robots (VIHAR); Interspecies 
Internet; and Project CETI (Cetacean 
Translation Initiative), which recently pro- 
vided a detailed roadmap for ML-assisted 
work on sperm whale (Physeter macroceph- 
alus) communication (2). Although efforts 
to tackle this grand research challenge are 
clearly intensifying, the field faces at least 
two main data-related obstacles: Most 
methods require vast amounts of data (4), 
and recordings of a single modality (e.g., 
vocalizations) are insufficient for functional 
decoding; additional context is required, in- 
cluding information on the animals’ behav- 
ior and environment. 

Large volumes of audio and video data are 
held in community-sourced archives (such as 
the Macaulay Library or xeno-canto), are be- 
ing accumulated by passive recording arrays, 
or can be scraped from the internet. Mining 
these data sources will provide fascinating 
glimpses of the richness of animal communi- 
cation, but on its own, such work is unlikely 
to achieve breakthroughs in decoding signal 
function. This is chiefly because robust in- 
formation on the identities and states of the 
senders and receivers, and the specific com- 
munication context, is usually lacking. 

High-quality datasets are available for 
some taxa, enabling swift progress with 
core model-development objectives. But it is 
clear that community mobilization and ap- 
propriate resourcing are required to ensure 
that species experts are fully involved in the 
annotation and interpretation of existing 
recordings and can lead targeted efforts to 
collect new data at scale, in both the labora- 
tory and field. For wild animals, a range of 
methods can be used to collect suitable data- 
sets, including observation of focal subjects, 
autonomous cameras and audio recorders, 
drones and robots, and animal wearables 
(bio-loggers). Some bio-logging devices can 
collect audio and body-motion data simul- 
taneously for the same individual, providing 
valuable input for multimodal ML models 
(see the figure). 

The journey could be the biggest reward. 
Training the lens of ML on a broad range of 
taxa will likely uncover surprising degrees 
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A MACHINE-INTELLIGENT WORLD 


Using multimodal data and experiments to understand animal signals 
Machine-learning (ML) methods can be used to integrate information on sender, receiver, and communication context, revealing patterns that may inform hypotheses 
about signal function and, in turn, the design of controlled experiments. ML-assisted research on animal communication will likely generate important benefits, such as 
improving animal conservation and welfare, but is not without its challenges; addressing ethical concerns is a top priority. 
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of previously hidden complexity in animals’ 
communicative behavior. Many of the species 
that appear to use only a handful of basic 
call types may turn out to possess rich vo- 
cal repertoires, and those that are renowned 
for their sophisticated communication may 
be shown to be more impressive still. There 
are early indications of the discovery poten- 
tial of ML, as highlighted by a recent study 
that explored individual and group differ- 
ences in the vocal behavior of zebra finches 
(Taeniopygia guttata) (8). 

The ability of ML to produce systematic 
inventories of vocal (or other signaling) out- 
put across a diverse range of taxa will enable 
unprecedented comparative analyses, help- 
ing researchers to pinpoint the evolutionary 
drivers, genomic signatures, life-history cor- 
relates, and cognitive and sensory founda- 
tions of different communication systems. 
At the same time, longitudinal recordings for 
individual subjects could reveal how commu- 
nication skills arise and mature (9). 

But perhaps most importantly, advances 
in this field could boost animal conserva- 
tion and welfare. For example, in critically 
endangered species, such as the Hawaiian 
crow (Corvus hawaiiensis), comparisons 
with historical baseline data could generate 
a detailed record of how population bottle- 
necks have altered vocal repertoires, po- 
tentially leading to impoverished commu- 
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nicative capabilities (JO); lost calls of high 
fitness relevance, such as those involved 
in foraging, courtship, or antipredator be- 
havior, could then conceivably be reintro- 
duced. Furthermore, there is increasing 
recognition that socially transmitted infor- 
mation may affect population viability (77), 
as illustrated by foraging specializations 
in killer whales (Orcinus orca) (12). Where 
vocal dialects can be established as “cul- 
tural markers,’ ML approaches would en- 
able automated mapping of social popula- 
tion structure and identification of animal 
groups at risk of losing critical knowledge. 

ML could also be used to identify animal 
signals that are associated with stress, dis- 
comfort, pain, and evasion, or with positive 
states, such as arousal and playfulness. This 
could provide momentum for improving 
the living conditions of livestock and other 
captive animals and may even enable the 
assaying of wild populations to measure the 
impact of anthropogenic stressors. Ecological 
“soundscape” analyses are, at present, largely 
focused on species detection, but it should be 
possible to listen in on animals’ welfare at the 
landscape level (73). This idea could be de- 
veloped further by looking beyond communi- 
cation, for example, by developing ML tools 
that can examine satellite-recorded animal 
movement tracks for signatures of disease, 
distress, or human avoidance. 


Despite manifold potential benefits, ML- 
assisted research on animal communica- 
tion raises major ethical questions, such as 
under what circumstances it is acceptable 
to conduct playback experiments with wild 
animals. Advanced chatbots may enable re- 
searchers to initiate communication with 
animals before signal function is fully under- 
stood, potentially causing unintended harm. 
For example, broadcasting vocalizations to 
wild humpback whales (Megaptera novae- 
angliae) could inadvertently trigger changes 
in singing behavior on an ocean-basin scale. 
These issues must be tackled head-on and 
not as an afterthought. Cross-stakeholder 
consultation is urgently needed to develop 
best-practice guidelines and appropriate leg- 
islative frameworks (J4). 

Other challenges and opportunities lie 
ahead. For example, it is important to coordi- 
nate research efforts across existing initiatives 
and to enhance engagement of experts on 
animal communication, tracking, conserva- 
tion, and welfare. Despite rapid technological 
advances, progress in this field will continue 
to depend on careful consideration of each 
study species’ biology, a detailed knowledge 
of communication contexts, and controlled 
behavioral experiments (3). This expertise 
is essential for informing and validating ML 
analyses and intensifying data interpretation 
and collection efforts. Professional societies 
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and networks could help coordinate inclusive 
community-driven collaborations. 

Workflows should be developed using 
study systems where data collection and ex- 
perimental validation are relatively straight- 
forward. In captive settings, researchers can 
ensure excellent experimental control as 
well as the highest ethical and welfare stand- 
ards; good models include rodents, bats, 
and birds. Such work can be complemented 
with analyses of extensive field datasets 
that are already available for some species. 
Once methods have been established, they 
can be cautiously applied to the considera- 
bly more challenging problem of studying 
difficult-to-observe wild animals. 

Present developments in ML are exception- 
ally fast paced. Beyond the use of deep-learn- 
ing methods, there is scope for trialing other 
ML frameworks, such as reinforcement learn- 
ing and meta-learning (i.e., learning from the 
output of other ML models). As models are 
developed, formal “benchmarking” will be 
key to improving the reliability and efficiency 
of analysis pipelines (75), although safeguards 
must be put in place to prevent the misuse of 
open resources, such as attempts to disturb, 
kill, or weaponize animals. 

ML holds the potential to generate trans- 
formative advances in our understanding 
of animal communication systems, uncov- 
ering unimagined degrees of richness and 
sophistication. But it is essential that future 
advances are used to benefit the animals be- 
ing studied. 
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Do we want less automation? 


AI may provide a path to decrease inequality 


By Ajay Agrawal, Joshua S. Gans, 
Avi Goldfarb 


mpressive achievements made through 

artificial intelligence (AI) innovations 

in automating the tasks required in 

many jobs have reinforced concerns 

about labor market disruption and in- 

creased income inequality. This has mo- 
tivated calls for change in the direction of 
AI innovation from being guided by task 
automation to instead focusing on labor 
augmentation (7). But task automation and 
labor augmentation are not polar oppo- 
sites. Instead, automation of some tasks can 
lead to augmentation of labor elsewhere. 
Furthermore, AI automation may provide 
a path to reversing the trend of increas- 
ing income inequality by enabling dispro- 
portionate productivity improvements for 
lower-wage workers, allowing them to per- 
form at levels that would previously require 
years of education and experience. 

People have worried about automation— 
using machines to do the work that humans 
do—for centuries. Over that time, automa- 
tion has proceeded apace, with entire sec- 
tors such as agriculture and manufacturing 
going from majority to minority shares of 
employment in many countries. Despite 
these sweeping changes, continual pro- 
ductivity improvements have not brought 
about technological unemployment. There 
has always been more for people to do, 
even if the fruits of economic growth have 
in recent times not lifted the incomes of all 
(2). Nonetheless, with the accelerated pace 
of innovations in AI technologies that spe- 
cifically target the automation of cognitive 
rather than physical labor, many econo- 
mists have become concerned that this time 
will be different, leading to substantial dis- 
ruption and increased inequality but with 
little benefit to productivity and standard of 
living (3, 4). 

The economic history of the internet 
and computing over the past 50 years sug- 
gests that worries about inequality are not 
unfounded. These were what economists 
call skill-biased technologies (5), placing 
increasing demands on the skill, educa- 
tion, and know-how of the workforce. As 
computers and the internet diffused in the 
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decades after 1980, demand for skills grew 
faster than supply, and inequality increased 
because of the disproportionate increase in 
wages for high-skilled workers. 

More recently, consider all of the new 
products based on generative AI that auto- 
mate the tasks of reading, writing, editing, 
summarizing, composing music, creating 
images, synthesizing speech, translating 
languages, programming computers, and 
producing videos. In each case, they prom- 
ise time-saving productivity boosts by sub- 
stituting capital (hardware or software) 
for the labor time that would otherwise be 
devoted to such tasks. If a machine can do 
these tasks, what will become of the people 
who did them previously? What if, instead 
of seeking to automate existing tasks, the 
mindset of the innovators was to provide 
tools that make existing workers more pro- 
ductive in their current jobs? 


AUTOMATION AND AUGMENTATION 
Automation and augmentation need not be 
opposites; the economic definitions of each 
are not presented as such. Economists study- 
ing automation consider a job to be a col- 
lection of tasks [as developed in (4, 6) and 
applied in (7-9)]. Automation occurs when 
a machine (capital) is substituted for labor 
performance of one or more tasks in a work- 
flow. This will typically increase productiv- 
ity, which determines long-term economic 
growth and the average standard of living. 
With technological innovation having long 
been recognized as a key source of produc- 
tivity growth, there is close to a consensus 
among leading economists that AI-driven 
technological change is likely to increase 
productivity, economic growth, and the av- 
erage standard of living (J0). However, if 
workers are replaced by machines, then this 
improved productivity can come at a sub- 
stantial cost to displaced workers. 

If technology creates new tasks, then 
displaced workers can find new things to 
do (4). Augmentation can therefore be seen 
as the development of new tasks (3), par- 
ticularly if those tasks complement exist- 
ing human labor (1). There is a challenge, 
however, in identifying such tasks and de- 
termining which inventions are likely to 
lead to their development. The difficulty is 
that one person’s automation is another’s 
augmentation. Automating one task may 
create even more new tasks. 
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and networks could help coordinate inclusive 
community-driven collaborations. 

Workflows should be developed using 
study systems where data collection and ex- 
perimental validation are relatively straight- 
forward. In captive settings, researchers can 
ensure excellent experimental control as 
well as the highest ethical and welfare stand- 
ards; good models include rodents, bats, 
and birds. Such work can be complemented 
with analyses of extensive field datasets 
that are already available for some species. 
Once methods have been established, they 
can be cautiously applied to the considera- 
bly more challenging problem of studying 
difficult-to-observe wild animals. 

Present developments in ML are exception- 
ally fast paced. Beyond the use of deep-learn- 
ing methods, there is scope for trialing other 
ML frameworks, such as reinforcement learn- 
ing and meta-learning (i.e., learning from the 
output of other ML models). As models are 
developed, formal “benchmarking” will be 
key to improving the reliability and efficiency 
of analysis pipelines (75), although safeguards 
must be put in place to prevent the misuse of 
open resources, such as attempts to disturb, 
kill, or weaponize animals. 

ML holds the potential to generate trans- 
formative advances in our understanding 
of animal communication systems, uncov- 
ering unimagined degrees of richness and 
sophistication. But it is essential that future 
advances are used to benefit the animals be- 
ing studied. 
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mpressive achievements made through 

artificial intelligence (AI) innovations 

in automating the tasks required in 

many jobs have reinforced concerns 

about labor market disruption and in- 

creased income inequality. This has mo- 
tivated calls for change in the direction of 
AI innovation from being guided by task 
automation to instead focusing on labor 
augmentation (7). But task automation and 
labor augmentation are not polar oppo- 
sites. Instead, automation of some tasks can 
lead to augmentation of labor elsewhere. 
Furthermore, AI automation may provide 
a path to reversing the trend of increas- 
ing income inequality by enabling dispro- 
portionate productivity improvements for 
lower-wage workers, allowing them to per- 
form at levels that would previously require 
years of education and experience. 

People have worried about automation— 
using machines to do the work that humans 
do—for centuries. Over that time, automa- 
tion has proceeded apace, with entire sec- 
tors such as agriculture and manufacturing 
going from majority to minority shares of 
employment in many countries. Despite 
these sweeping changes, continual pro- 
ductivity improvements have not brought 
about technological unemployment. There 
has always been more for people to do, 
even if the fruits of economic growth have 
in recent times not lifted the incomes of all 
(2). Nonetheless, with the accelerated pace 
of innovations in AI technologies that spe- 
cifically target the automation of cognitive 
rather than physical labor, many econo- 
mists have become concerned that this time 
will be different, leading to substantial dis- 
ruption and increased inequality but with 
little benefit to productivity and standard of 
living (3, 4). 

The economic history of the internet 
and computing over the past 50 years sug- 
gests that worries about inequality are not 
unfounded. These were what economists 
call skill-biased technologies (5), placing 
increasing demands on the skill, educa- 
tion, and know-how of the workforce. As 
computers and the internet diffused in the 
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decades after 1980, demand for skills grew 
faster than supply, and inequality increased 
because of the disproportionate increase in 
wages for high-skilled workers. 

More recently, consider all of the new 
products based on generative AI that auto- 
mate the tasks of reading, writing, editing, 
summarizing, composing music, creating 
images, synthesizing speech, translating 
languages, programming computers, and 
producing videos. In each case, they prom- 
ise time-saving productivity boosts by sub- 
stituting capital (hardware or software) 
for the labor time that would otherwise be 
devoted to such tasks. If a machine can do 
these tasks, what will become of the people 
who did them previously? What if, instead 
of seeking to automate existing tasks, the 
mindset of the innovators was to provide 
tools that make existing workers more pro- 
ductive in their current jobs? 


AUTOMATION AND AUGMENTATION 
Automation and augmentation need not be 
opposites; the economic definitions of each 
are not presented as such. Economists study- 
ing automation consider a job to be a col- 
lection of tasks [as developed in (4, 6) and 
applied in (7-9)]. Automation occurs when 
a machine (capital) is substituted for labor 
performance of one or more tasks in a work- 
flow. This will typically increase productiv- 
ity, which determines long-term economic 
growth and the average standard of living. 
With technological innovation having long 
been recognized as a key source of produc- 
tivity growth, there is close to a consensus 
among leading economists that AI-driven 
technological change is likely to increase 
productivity, economic growth, and the av- 
erage standard of living (J0). However, if 
workers are replaced by machines, then this 
improved productivity can come at a sub- 
stantial cost to displaced workers. 

If technology creates new tasks, then 
displaced workers can find new things to 
do (4). Augmentation can therefore be seen 
as the development of new tasks (3), par- 
ticularly if those tasks complement exist- 
ing human labor (1). There is a challenge, 
however, in identifying such tasks and de- 
termining which inventions are likely to 
lead to their development. The difficulty is 
that one person’s automation is another’s 
augmentation. Automating one task may 
create even more new tasks. 
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SPECIAL SECTION 


A MACHINE-INTELLIGENT WORLD 


The use of a machine in a task is labor 
augmenting if the increase in productiv- 
ity in that task raises labor productivity in 
other tasks. This will happen if tasks are 
complements. It may also arise if the time 
saved in performing the now automated 
task allows allocation of time to other tasks 
that involve higher productivity boosts. 
The impact on labor demand, in the end, 
depends on how much more productive the 
automated process is. Automation may be 
the correct decision for managers, yet in 
aggregate, the productivity benefit may be 
small (4). Several economists have argued 
that the current tax system and corporate 
environment encourage such incrementally 
useful automation (“so-so technologies”) 
that substitutes for, rather than comple- 
ments, labor (3, 4). 

This distinction between augmentation 
and automation is illustrated by a recent 
study of the impact of AI on Tokyo taxi driv- 
ers (11). The task in question was to predict 
what locations would have greater con- 
sumer demand for rides at any given time. 
A product, AI Navi, was designed with the 
goal of automating the task of predicting 
passenger demand. This improved the pro- 
ductivity of drivers, who spent up to 7% less 
time driving without passengers. Thus, one 
task—predicting passenger demand—was 
automated, which augmented the produc- 
tivity of another task—driving—and with it, 
the taxi drivers’ entire job. 

AI Navi was a direct substitute for worker 
effort and skill. For experienced drivers, this 
substitution did not increase the productiv- 
ity of that task, and hence, job performance 
was essentially unchanged. For less-skilled 
workers, however, their overall job perfor- 
mance improved by 14%. Similar results 
have been found in other contexts. For ex- 
ample, the use of a generative AI assistant 
in a call center substantially improved the 
productivity of novice and low-skilled work- 
ers but had little impact on the most pro- 
ductive workers (12). 

This type of impact stands in contrast to 
the “skill-biased technical change’—when 
those with higher wages had their work 
augmented, and others had their work au- 
tomated—that economists believe explains 
much of the increases in inequality over the 
past few decades. For example, increased 
adoption of computing was shown to drive 
expansion of high-wage and low-wage work 
at the expense of middle-wage jobs in the 
1980s (13). This change was likely a result of 
computing complementing abstract high- 
education tasks and substituting routine 
middle-education tasks. 

Often, the computer did not automate 
the core tasks of particular workers but aug- 
mented those with specific skills. Consider 
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word processing. Before word processing 
on personal computers, typing was a sepa- 
rate process that involved a specialized skill 
that was central to millions of jobs. A typist 
did several related tasks involving typing, 
error correction, and page setting. Word 
processing on personal computers did not 
automate typing, but it did automate error 
correction (the entire document no longer 
needed to be retyped). This made it efficient 
for knowledge workers to type themselves 
because automating the time-consuming 
task of correcting errors meant that typing 
errors were no longer consequential. This 
augmented what a knowledge worker could 
do. In the process, it reduced demand for 
secretarial workers who received midlevel 
wages and increased demand for knowl- 
edge workers who already were earning 
high wages. To economists, automated error 
correction may have seemed like augmenta- 
tion because it is not listed as a specific task 
in the databases they use to understand 
workflows [see supplementary materials 


(SM)]. Instead, for typists, secretaries, or 
administrative assistants, automated error 
correction is an implied part of other tasks 
such as checking for grammar and format- 
ting errors, which did not get automated. 
More generally, computers replaced work- 
ers engaged in routine technical duties, such 
as bookkeeping, administrative tasks, and 
manufacturing processes. Simultaneously, 
they enhanced the abilities of educated pro- 
fessionals who excel in areas such as prob- 
lem-solving, creativity, and persuasion (6). 


HOW Al MIGHT REDUCE INEQUALITY 

When we look further into AI success sto- 
ries of automation, it is often the case that 
tasks are automated in a way that is not 
skill biased, and indeed is the opposite: al- 
lowing more people to perform entire jobs 
at a level that previously required expensive 
skills. Hence, they present an opportunity 
to reduce the premium paid to workers with 
those skills and distribute employment op- 
portunities more widely. 
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Diagnosis 

A burgeoning body of research has dem- 
onstrated the potential of AI in delivering 
medical diagnoses (J4). At the core of this 
research lies the understanding that diag- 
nosis is fundamentally a predictive process, 
taking known symptoms and determining 
their underlying causes. Traditionally, diag- 
nostic expertise has been the cornerstone of 
a physician’s skillset, with medical schools 
and selection processes emphasizing accu- 
rate diagnosis. 

Whereas other health care professionals 
may excel in patient support, stress man- 
agement, or day-to-day care, the ability 
to diagnose is a key skill that sets doctors 
apart. Consequently, we can consider di- 
agnosis as one task, with other medical re- 
sponsibilities representing other tasks. The 
expertise required for diagnosis is relatively 
rare compared with the skills associated 
with other medical tasks. 

Al-driven diagnostic tools, however, have 
the potential to automate this rare skill. In 
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Al-driven diagnostic tools, with appropriate policy 
support, hold potential to expand opportunities and 
boost wages for nonphysician health care workers. 


2021, there were approximately 760,000 
physician and surgeon positions in the 
United States, with median annual incomes 
exceeding $200,000 (see SM). By automat- 
ing the core diagnostic skill that underpins 
these roles, AI has the potential to substan- 
tially disrupt the value that doctors bring 
to health care, potentially causing employ- 
ment stagnation and wage suppression. 
This concern aligns with cautionary predic- 
tions regarding Al’s replication of human 
intelligence (J, 3). 

By contrast, the wider health care profes- 
sion—which includes 3 million registered 
nurses and millions of other professionals 
such as pharmacists, nurse practitioners, 
physician assistants, and paramedics— 
could potentially benefit from AlI-driven 
diagnostics. Diagnostic capabilities often 
act as a barrier for these professionals to 
fully use their skills (74). By granting access 
to Al-assisted diagnostics, these nonphysi- 
cian health care workers could experience 
an expansion in career opportunities and a 
potential boost in wages. 

In this case, whether AI reduces inequal- 
ity is not driven by a decision of whether to 
automate or augment. Instead, the impact 
depends on whose core skills get automated. 
If AI automates the higher-wage workers 
such as doctors, then it might augment 
other medical professionals and decrease 
inequality. By contrast, if the regulatory en- 
vironment or industry incentive structure 
direct innovation toward automating the 
lower-wage workers—for example, because 
the decisions made by physicians cannot 
be delegated to others by law—then any AI 
automation will focus on augmenting those 
who are already earning high incomes and 
automating others (/4). 


Writing and editing 

Generative AI tools such as ChatGPT sug- 
gest the potential to automate reading, writ- 
ing, analyzing, and editing. Data from labor 
market analytics firm Lightcast show that 
474,873 US job postings in January 2023 re- 
quired writing or editing skills, out of a total 
of 3.4 million job postings (see SM). Thus, 
more than 10% of posted jobs are likely to 
be affected by automated writing tools. Jobs 
that require writing and editing skills have 
a higher posted salary on average than that 
for jobs that do not require such skills. The 
four job categories for which more than 75% 
of postings demand writing and editing 
skills are proofreaders and copy markers, 
technical writers, editors, and writers and 
authors. Without the writing and editing 


tasks that can be done with large language 
models (LLMs) such as ChatGPT, the jobs of 
proofreaders, copy markers, and technical 
writers might be completely replaced. 

But the situation for editors, writers, and 
authors is different. There are a number of 
decisions involved that might not be auto- 
mated with the recent advances in LLMs. 
In the presence of LLMs, the task that we 
classify as writing can be broken down into 
three tasks: prompting, writing, and sign- 
ing off. Only the writing task is automated. 
To determine whether a job that previously 
assigned importance to writing skills would 
be affected positively or negatively, we 
would need to know how precisely those 
skills are now applied. 

To see this, consider journalism, a job cat- 
egory in which 65% of job postings list writ- 
ing or editing skills [see models in (10) and 
(14)]. Print journalism places high weight 
on writing skills. What is less obvious is 
whether journalists engage in prompting or 
signing off as part of their job. There are 
two possibilities. In job one, a journalist 
may be assigned an article to write by an 
editor. They then write the article, and the 
editor decides whether it can be published. 
In job two, a journalist comes to an editor 
with a proposal. They then write the arti- 
cle, and the editor decides whether it can 
be published. The only difference between 
these two jobs is who comes up with the 
idea for the article. 

That distinction matters when we intro- 
duce LLMs that can do the narrowly defined 
writing task. If the editor is the prompter, 
the editor may prompt an LLM rather than 
a human journalist to do the writing. Thus, 
journalists who are not prompters may be 
less valued when LLMs are deployed. By 
contrast, if the journalist is the prompter, 
the LLM becomes a tool for the journalist 
to more efficiently produce the article. The 
LLM increases the value of what the jour- 
nalist is doing in terms of identifying in- 
teresting topics to write about. At present, 
both jobs are classified as journalists, so 
we cannot determine how the entire class 
of workers will be affected. Furthermore, 
LLMs create opportunities for any people 
who have a good sense of which stories are 
valuable and important but who lack writ- 
ing skills. 

In each case, these are examples of de-skill- 
ing automation. Technology automates a task 
that requires highly specialized and highly 
paid workers, enabling less-specialized or 
lower-paid workers to include that task in 
their workflow. The demand for the skills of 
the highly-paid workers declines, whereas the 
demand for less-highly-paid workers is likely 
to increase. This should reduce inequality 
while increasing overall output, albeit reduc- 
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ing the relative (and perhaps absolute) in- 
come of the previously highly-paid workers 
whose skills have been automated. 

It is too early to determine with much cer- 
tainty how this will play out for AI, whether 
the impact on any particular job will be 
positive or negative. Research is beginning 
to emphasize which jobs are most likely to 
be affected rather than lost (8). For exam- 
ple, that classification tasks such as image 
recognition can be done with AI will affect 
workers whose jobs involve classification 
tasks, such as radiologists (7). Recent work 
examining differences between generative 
AI (specifically, LLMs) and nongenerative 
AI [as described in (7)] shows that millions 
of jobs have the potential to be affected by 
LLMs. Notably, these studies emphasize 
that “affect” does not mean “replace.” For 
many jobs, automating some aspects of the 
workflow might increase productivity, the 
wages of workers who have that job, and 
the number of workers hired to do that job. 

Even when some jobs get automated, that 
might complement the tasks done by other 
workers. Many empirical exercises [for ex- 
ample, (7-9)] emphasize the direct impact 
on jobs, but they do not explore the jobs 
that might be enhanced through comple- 
mentary production processes. For exam- 
ple, in January 2023, there were 186,417 job 
postings in the United States that specified 
language skills (such as Spanish Language 
or American Sign Language), or about 5% of 
the total job postings (see SM). Automating 
language translation would directly affect 
many of these jobs. At the same time, many 
other jobs that do not require language 
skills would also be affected. For example, a 
recent study showed that small businesses 
that used a rudimentary automated lan- 
guage translation tool on eBay experienced 
a 17.5% increase in exports to markets where 
that language is used (15). Automation of 
some jobs could create opportunities for 
those whose work would appear to be un- 
affected, as measured with the tasks and 
skills involved in current workflows. 


CONCLUSION 

Many economists who have studied the im- 
pact of automation on labor markets have 
argued recently that the direction of AI 
research needs to be changed away from 
automating tasks to focusing on overall 
job augmentation. The implicit argument 
is that a focus on augmentation will lead 
to more complementarity with lower-wage 
labor and more new tasks. However, many 
recent advances in AI that have been de- 
veloped with the explicit goal of task auto- 
mation have appeared to increase worker 
productivity; that is, task automation has 
been labor augmenting. Furthermore, AI 
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technology may disproportionately aug- 
ment lower-skilled labor, reducing income 
inequality. This, at the very least, calls into 
question whether a change in the innova- 
tor’s mindset is needed: Task automation 
may be a path to substantially improved 
labor productivity. 

This potential to reverse the recent 
trend toward skill-biased technical change 
does not mean that AI is without risk. Other 
concerns remain, including those related to 
privacy, liberty, democracy, and monopoly 
power (3). Our emphasis is on understand- 
ing that one person’s automation is anoth- 
er’s augmentation, and that it is difficult for 
engineers or policy-makers to pick which 
particular innovation will increase or re- 
duce inequality overall. We believe that both 
regulators and engineers should be careful 
in shutting down a particular technology 
trajectory because it appears to automate 
human work. In the process of automating 
some work, other work can be augmented. 

Often, our analysis suggests that such 
augmentation from AI will increase the job 
productivity of less-skilled workers who 
can now perform at levels achieved by their 
skilled counterparts. This suggests that skill 
premia that have contributed to widening 
inequality may be eroded. Thus, it is quite 
plausible that the use of AI to automate 
tasks will both increase productivity and 
decrease income inequality. If so, then we 
may want more automation, not less. 
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enerative artificial intelligence (AI) 
is a disruptive technology that is 
widely adopted by members of the 
general public as well as scientists 
and technologists who are enthusi- 
astic about the potential to acceler- 
ate research in a wide variety of fields. But 
some professional artists, writers, and pro- 
grammers fiercely object to the use of their 
creations as training data for generative AI 
systems and to outputs that may compete 
with or displace their works (/, 2). Lack of 
attribution and compensation for use of 
their original creations are other sources 
of aggravation to critics of generative AI. 
Copyright lawsuits that are now underway 
in the United States have substantial im- 
plications for the future of generative AI 
systems. If the plaintiffs prevail, the only 
generative AI systems that may be lawful 
in the United States would be those trained 
on public domain works or under licenses, 
which will affect everyone who deploys 
generative AI, integrates it into their prod- 
ucts, and uses it for scientific research. 
Conflicts between creators of copy- 
righted works and developers of technolo- 
gies that enable the use of those creations 
in unexpected and innovative ways is noth- 
ing new. In the early 20th century, the dis- 
ruptive technology of the day was player 
pianos. Music copyright owners sued the 
makers of piano rolls, claiming that rolls of 
their musical compositions were infringe- 
ments. Subsequent copyright-disruptive 
technologies have included cable televi- 
sion, photocopiers, videotape recording 
machines, and MP3 players, each of which 
(except photocopiers) attracted copyright 
industry challenges (all of which failed in 
the courts, although Congress sometimes 
later extended protections in the after- 
math of failed lawsuits). 
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ing the relative (and perhaps absolute) in- 
come of the previously highly-paid workers 
whose skills have been automated. 

It is too early to determine with much cer- 
tainty how this will play out for AI, whether 
the impact on any particular job will be 
positive or negative. Research is beginning 
to emphasize which jobs are most likely to 
be affected rather than lost (8). For exam- 
ple, that classification tasks such as image 
recognition can be done with AI will affect 
workers whose jobs involve classification 
tasks, such as radiologists (7). Recent work 
examining differences between generative 
AI (specifically, LLMs) and nongenerative 
AI [as described in (7)] shows that millions 
of jobs have the potential to be affected by 
LLMs. Notably, these studies emphasize 
that “affect” does not mean “replace.” For 
many jobs, automating some aspects of the 
workflow might increase productivity, the 
wages of workers who have that job, and 
the number of workers hired to do that job. 

Even when some jobs get automated, that 
might complement the tasks done by other 
workers. Many empirical exercises [for ex- 
ample, (7-9)] emphasize the direct impact 
on jobs, but they do not explore the jobs 
that might be enhanced through comple- 
mentary production processes. For exam- 
ple, in January 2023, there were 186,417 job 
postings in the United States that specified 
language skills (such as Spanish Language 
or American Sign Language), or about 5% of 
the total job postings (see SM). Automating 
language translation would directly affect 
many of these jobs. At the same time, many 
other jobs that do not require language 
skills would also be affected. For example, a 
recent study showed that small businesses 
that used a rudimentary automated lan- 
guage translation tool on eBay experienced 
a 17.5% increase in exports to markets where 
that language is used (15). Automation of 
some jobs could create opportunities for 
those whose work would appear to be un- 
affected, as measured with the tasks and 
skills involved in current workflows. 


CONCLUSION 

Many economists who have studied the im- 
pact of automation on labor markets have 
argued recently that the direction of AI 
research needs to be changed away from 
automating tasks to focusing on overall 
job augmentation. The implicit argument 
is that a focus on augmentation will lead 
to more complementarity with lower-wage 
labor and more new tasks. However, many 
recent advances in AI that have been de- 
veloped with the explicit goal of task auto- 
mation have appeared to increase worker 
productivity; that is, task automation has 
been labor augmenting. Furthermore, AI 
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technology may disproportionately aug- 
ment lower-skilled labor, reducing income 
inequality. This, at the very least, calls into 
question whether a change in the innova- 
tor’s mindset is needed: Task automation 
may be a path to substantially improved 
labor productivity. 

This potential to reverse the recent 
trend toward skill-biased technical change 
does not mean that AI is without risk. Other 
concerns remain, including those related to 
privacy, liberty, democracy, and monopoly 
power (3). Our emphasis is on understand- 
ing that one person’s automation is anoth- 
er’s augmentation, and that it is difficult for 
engineers or policy-makers to pick which 
particular innovation will increase or re- 
duce inequality overall. We believe that both 
regulators and engineers should be careful 
in shutting down a particular technology 
trajectory because it appears to automate 
human work. In the process of automating 
some work, other work can be augmented. 

Often, our analysis suggests that such 
augmentation from AI will increase the job 
productivity of less-skilled workers who 
can now perform at levels achieved by their 
skilled counterparts. This suggests that skill 
premia that have contributed to widening 
inequality may be eroded. Thus, it is quite 
plausible that the use of AI to automate 
tasks will both increase productivity and 
decrease income inequality. If so, then we 
may want more automation, not less. 
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enerative artificial intelligence (AI) 
is a disruptive technology that is 
widely adopted by members of the 
general public as well as scientists 
and technologists who are enthusi- 
astic about the potential to acceler- 
ate research in a wide variety of fields. But 
some professional artists, writers, and pro- 
grammers fiercely object to the use of their 
creations as training data for generative AI 
systems and to outputs that may compete 
with or displace their works (/, 2). Lack of 
attribution and compensation for use of 
their original creations are other sources 
of aggravation to critics of generative AI. 
Copyright lawsuits that are now underway 
in the United States have substantial im- 
plications for the future of generative AI 
systems. If the plaintiffs prevail, the only 
generative AI systems that may be lawful 
in the United States would be those trained 
on public domain works or under licenses, 
which will affect everyone who deploys 
generative AI, integrates it into their prod- 
ucts, and uses it for scientific research. 
Conflicts between creators of copy- 
righted works and developers of technolo- 
gies that enable the use of those creations 
in unexpected and innovative ways is noth- 
ing new. In the early 20th century, the dis- 
ruptive technology of the day was player 
pianos. Music copyright owners sued the 
makers of piano rolls, claiming that rolls of 
their musical compositions were infringe- 
ments. Subsequent copyright-disruptive 
technologies have included cable televi- 
sion, photocopiers, videotape recording 
machines, and MP3 players, each of which 
(except photocopiers) attracted copyright 
industry challenges (all of which failed in 
the courts, although Congress sometimes 
later extended protections in the after- 
math of failed lawsuits). 
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When new technologies pose new copy- 
right questions that Congress did not an- 
ticipate, courts typically consider which 
outcome is most consistent with the con- 
stitutional purposes of copyright. The 
Constitution gives Congress the power “to 
promote the progress of Science and use- 
ful Arts,” that is, to foster the creation and 
dissemination of knowledge for the public 
good. This requires balancing the legiti- 
mate interests of copyright owners to pre- 
vent misappropriations of their works that 
undermine incentives to create with the 
legitimate interests of developers of inno- 
vative technologies and follow-on creators 
who need some breathing space in which 
they, too, can innovate. 

What makes generative AI more dis- 
ruptive than previous technologies? One 
factor is certainly the exceptionally rapid 
pace at which generative AI technologies 
have been launched, adopted, and adapted. 
Evolution in the fields of law and policy, by 
contrast and of necessity, is much slower. 
It is, moreover, not easy to assess how to 
calibrate balances among competing copy- 
right interests in the early stages of new- 
technology evolutions. Generative AI seems 
poised to have substantial impacts 
on the careers of professional writ- 
ers and artists. During the 2023 
Writers Guild of America strike, 
for instance, uses of generative 
AI are one focus of negotiations. 
Screenwriters are understandably worried 
that these technologies will displace them 
or diminish their compensation. 

Stability AI is defending two copyright 
infringement lawsuits in the United States 
that are focused on Stable Diffusion, a 
widely used image generator. Getty Images 
is the plaintiff in one of these lawsuits. The 
other is a class-action lawsuit on behalf 
of visual artists on whose images Stable 
Diffusion was trained. Both complaints 
assert that Stability AI made unlawful 
copies of the plaintiffs’ images when in- 
gesting them as inputs for training Stable 
Diffusion’s model and that output images 
produced by Stable Diffusion in response 
to user prompts are infringing derivative 
works. 

A third generative AI lawsuit (Doe v. 
Github, Inc.) challenges OpenAl’s devel- 
opment of Codex, a large language model 
(LLM) trained upon billions of lines of 
open-source software code. Also challenged 
is GitHub and OpenAI’s collaborative de- 
velopment of Copilot, a coding assistant 
tool that draws upon the Codex LLM to 
suggest lines of code for specific functions 
in response to user prompts. (Microsoft, 
which owns GitHub and has invested heav- 
ily in OpenAI, is a fellow defendant.) 
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Rulings in favor of plaintiffs might trig- 
ger “innovation arbitrage,” causing devel- 
opers of generative AI systems to move 
their bases of operation to countries that 
regard the ingestion of copyrighted works 
as training data as fair use, like Israel’s 
Ministry of Justice did in early 2023. Other 
countries that want to attract AI innova- 
tions may follow suit. If courts uphold the 
Stability AI plaintiffs’ claims, OpenAI’s 
GPT4 and Google’s BARD may also be in 
jeopardy. Their developers would be very 
attractive targets of follow-on lawsuits. 


INGESTING TRAINING DATA 
Stability AI has yet to articulate its main 
defenses to the copyright charges. Insofar 
as the complaints allege that Stable 
Diffusion contains copies of in-copyright 
images used as training data, the claims 
are factually and technically inaccurate. 
Stable Diffusion contains an extremely 
large number of parameters that math- 
ematically represent concepts embodied in 
the training data, but the images as such 
are not embodied in its model. 

Training a model begins by tokenizing 
the contents of works ingested as training 


“The complaints against Stability Al overlook 
the intentionally porous nature of copyrights.” 


data into component elements. The model 
uses these tokens to discern statistical 
correlations—often at staggeringly large 
scales—among features of the content on 
which the model is being trained. In es- 
sence, the model is extracting and analyz- 
ing precise facts about, and correlations 
between, discrete elements of the works 
to ascertain which other discrete elements 
either do or do not follow or are proximate 
to these elements and the frequency with 
which the correlations do or do not exist in 
varying contexts. 

The complaints against Stability AI 
overlook the intentionally porous nature 
of copyrights. What copyright law protects 
is only the original expression that authors 
contribute (such as sequences of words ina 
poem or the melody of music). Copyright’s 
scope never extends to any ideas, facts, or 
methods embodied in works nor to ele- 
ments common in works of that kind (un- 
der copyright’s “scenes a faire” doctrine), 
elements capable of being expressed in 
very few ways (under the “merger” doc- 
trine), or the underlying subjects depicted 
in protected works. Photographs of cats, 
for instance, do not give the photographer 
exclusive rights to characteristic features 
of cats, such as their noses or facial expres- 


sions. Nor does copyright’s scope extend 
to inferences that readers might draw 
from reviewing an author’s works, such 
as insights about patterns of connections 
among concepts or how works of that kind 
are constructed. 

Moreover, Stability AI did not prepare 
the dataset on which the Stable Diffusion 
model was trained. This was done by a 
nonprofit German research organization 
known as LAION (Large-Scale Artificial 
Intelligence Open Network). LAION ini- 
tially developed LAION-5B, a dataset 
consisting of 5.85 billion hyperlinks that 
pair images and text descriptions from 
the open internet. LAION makes this da- 
taset available to the public for free for 
use as training data for those who want to 
use it to build generative models. LAION 
also developed a subset of LAION-5B, 
known as LAION-Aesthetics, that con- 
sists of hyperlinks to 600 million images 
selected by some human testers for their 
visual appeal and by a machine-learning 
analysis of human aesthetic ratings. The 
Stable Diffusion model was trained on the 
LAION-Aesthetics dataset. 

LAION’s creation of this dataset was very 
likely lawful because the European 
Union (EU) adopted an exemp- 
tion allowing nonprofit research 
organizations to make copies of in- 
copyright works for text and data 
mining (TDM) purposes. The EU 
created this exception in recognition of the 
societal value of TDM as a means by which 
researchers can create new knowledge. 
This exemption cannot be overridden by 
contract. (A second EU exemption autho- 
rizes commercial actors to engage in TDM, 
although copyright owners can opt out of 
this exemption, as some have done.) 

Stability AI makes Stable Diffusion avail- 
able on an open-source basis. However, it 
also provides a subscription service so that 
those who lack resources or the inclination 
to host the open-source version can have 
access to Stable Diffusion to generate im- 
ages in response to text prompts. Yet, in- 
sofar as ingesting in-copyright images to 
train a generative model requires making 
at least temporary or incidental copies of 
them, Stability AI is likely to argue that 
this is a fair use under US copyright law. 


FAIR USE 

Under US law, fair uses of in-copyright 
works do not infringe copyrights. Courts 
consider four factors when assessing fair 
use defenses: (i) the purpose of the chal- 
lenged use, (ii) the nature of the copyrighted 
works, (iii) the amount and substantiality of 
the taking, and (iv) the effect of the chal- 
lenged use on the market for or value of the 
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copyrighted work. The purpose and market 
effects factors are generally the most impor- 
tant determinants in fair-use cases, but all 
four factors must be weighed together in a 
holistic analysis. 

Research, scholarship, and teaching are 
among the favored fair-use purposes, as 
are criticism, comment, and news report- 
ing. Noncommercial uses are generally 
favored more than commercial uses. Since 
1994, when the Supreme Court considered 
the fairness of 2 Live Crew’s rap parody of 
a popular Roy Orbison song in Campbell 
v. Acuff-Rose Music, Inc., courts have given 
considerable weight to whether the pur- 
pose of a challenged use was “transforma- 
tive.” The Court defined this term as uses 
that “add[] something new, with a further 
purpose or different character, altering 
the first with new expression, meaning, or 
message” [(3), p. 579]. Transformative uses 
are also less likely than nontransformative 
uses to harm the market for the first work. 
People are, for instance, unlikely to pur- 
chase 2 Live Crew’s parody if they want to 
listen to Roy Orbison’s rendition. 

The Stability AI plaintiffs will likely ar- 
gue that the ingestion of their works as 
training data was nontransformative and 
commercial. Both considerations would, 
if accepted, tip against fair use. However, 
several court decisions have ruled that 
analogous digital uses of in-copyright 
works qualified as transformative fair uses. 

For example, in Authors Guild v. Google, 
Inc., a court ruled that Google’s digitiza- 
tion of millions of books from research 
library collections to index their contents 
and serve a few snippets of book contents 
in response to user search queries was a 
“highly transformative” fair use. Although 
Google’s purpose was commercial, it was 
very different from the purposes for which 
the books were marketed. Google’s use 
facilitated greater public access to knowl- 
edge as well as enabling TDM research and 
the creation of new research tools. In Field 
v. Google, Inc., a court found that Google’s 
cache copying of contents from Field’s 
website was a transformative fair use. 

The nature-of-the-work factor often has 
little importance in fair-use cases. The 
Stability AI plaintiffs may argue that be- 
cause works of visual art lie at the very 
core of copyright, fair use should be thin- 
ner for these works than for the old library 
books at issue in the Authors Guild case. 
A countervailing consideration is that 
the visual artists on whose works Stable 
Diffusion trained made their works avail- 
able on the open internet, as did Field in 
the Google, Inc., case. 

Transformative purposes tend to have 
spillover effects on other fair-use factors, es- 
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pecially the amount factor. As in the Authors 
Guild case, the Stability AI plaintiffs may 
emphasize that the defendant made exact 
copies of the entirety of many millions of 
works without permission or compensation. 
However, courts typically inquire whether 
such copying was necessary to achieve a 
transformative purpose. In the Authors 
Guild case, the court recognized that Google 
could not index book contents and serve up 
snippets in response to search queries unless 
it copied the books’ contents. Stability AI 
will likely make a similar necessity argument 
about training-data usages of images. 

The market effect of a challenged use is 
sometimes said to be the most important 
fair-use factor. The Getty complaint against 
Stablity AI emphasizes that it has established 
a licensing market for use of its premium pho- 
tographs as training data for generative AI. 
That bolsters Getty’s argument that Stability 
Al’s appropriation of 12 million images from 
Getty websites has harmed a licensing mar- 
ket. The class-action claim against Stability 
AI is weaker because Stability AI could not 
have gotten a license from the class of visual 
artists whose works were ingested to con- 
struct the Stable Diffusion model. 

The existence of a licensing market (or 


an intent to establish one) is not, however, 
a consideration that by itself can resolve 
a dispute in transformative fair use cases. 
In its 2021 Google LLC v. Oracle America, 
Inc., decision, the Supreme Court rejected 
Oracle’s argument that Google’s use of 
parts of the Java application program- 
ming interface (API) had deprived Oracle 
of license revenues to which it claimed an 
entitlement. The Court stated that courts 
should consider the public benefits of a 
challenged use as well as potential lost 
revenues and how much creativity a chal- 
lenged use has enabled and balance this 
against potential losses. 

This consideration was very relevant 
in the Oracle case. Not only was Google’s 
Android smartphone platform, in which the 
Java API was used, a highly innovative new 
software product, but it enabled millions of 
programmers to use their familiarity with 
the Java API to create many millions of 
programs. The Court thought this use was 
consistent with the constitutional objective 
of copyright to promote creative progress. 
The public greatly benefited from Android’s 
existence and the availability of large num- 
bers of apps that ran on that platform. 

Stability AI will almost certainly channel 


science.org SCIENCE 


& 


id 


the public-benefit and creative-impacts state- 
ments in the Oracle decision and point to 
the exceptional creativity embodied in Stable 
Diffusion as well as the hundreds of millions 
of creative uses of this generative AI system, 
including those by graphic artists who use it 
to generate ideas or refine creations. 

The Stability AI plaintiffs will likely 
counter this argument with the Supreme 
Court’s 2023 ruling in Andy Warhol 
Foundation for the Visual Arts, Inc. v. 
Goldsmith, which somewhat narrowed the 
conception of transformative purposes. It 
no longer suffices for challenged works to 
have a new meaning or message. More im- 
portant now are whether the challenged 
use has a different purpose than the first 
work and how commercial the use is. 
Stability AI will argue that ingesting copy- 
righted materials as training data had a 
very different purpose than the works as 
first published. 

What might tip the scale against 
Stability AI’s fair-use defense is whether 
images produced by Stable Diffusion in- 
fringe the derivative work right of the 
authors of the images on which its LLM 
was trained. A relevant precedent is Sega 
Enterprises Ltd. v. Accolade, Inc., in which 
an appellate court decided that Accolade 
had made fair use of Sega software when 
making reverse-engineered copies for the 
legitimate purpose of extracting informa- 
tion about how to make its videogames 
compatible with the Sega platform. Had 
Accolade reverse-engineered for an illegiti- 
mate purpose, such as to appropriate ex- 
pression from the Sega games, its fair-use 
defense would have faltered. The Accolade 
games competed with Sega’s games, but 
the court thought that this was the kind of 
competition among noninfringing works 
that copyright is supposed to foster. 


OUTPUTS AS INFRINGEMENTS 
The class-action complaint against Stability 
AI asserts that all images produced by 
Stable Diffusion are infringing derivative 
works because all are derived from the im- 
ages on which its model trained. It charac- 
terizes Stable Diffusion as a “collage tool” 
whose outputs compete against the artists’ 
own works and thereby harm their mar- 
kets. Users of Stable Diffusion, moreover, 
can submit prompts requesting the genera- 
tion of an image of a particular subject “in 
the style of” a specific named artist. 
However, courts have long held that to 
infringe copyright’s derivative work right, 
it is not enough to show that a second work 
was “based upon” an earlier work or some 
of its elements. The second work must 
have appropriated a substantial quantum 
of the first work’s original expression. So, 
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unless a court decides to overturn decades 
of precedents interpreting the derivative 
work right and broaden it substantially, 
the class action’s output infringement 
claim is likely to fail. 

The class-action complaint acknowl- 
edges that “[i]n general, none of the Stable 
Diffusion output images provided in re- 
sponse to a particular Text Prompt is likely 
to be a close match for any specific image 
in the training data” [(4), p. 23]. Even “in 
the style of” claims seem weak because 
copyright law does not protect styles as 
such. Infringement can be found only if 
there is a close resemblance between ex- 
pressive elements of a stylistically similar 
work and original expression in particular 
works by that artist. 

The reason that Stable Diffusion outputs 
are highly unlikely to be substantially simi- 
lar to particular images on which its model 
was trained is due to how Stable Diffusion 
assembles them. Constructing a model for 
an image-generating AI requires process- 
ing enormous quantities of input data to 
produce abstract representations of image 
elements (such as cats playing with a ball 
on a linoleum floor). Diffusion adds noise 
to image elements when encoding them. 
The pairing of text descriptions and images 
allows the model to cluster the abstract 
representations so that similar representa- 
tions will be in proximity (representations 
of cats near other cat representations). 
When a user enters a prompt directing 
the software to generate a specific type of 
output, the generative AI system uses com- 
plex statistical calculations to assemble an 
output that the system predicts will match 
what the user requested. 

It is, however, possible for generative AI 
outputs to infringe copyrights. If the same 
input image (say, of Mickey Mouse) is pres- 
ent in many works on which the model was 
trained and its developer did not follow 
industry best practices by eliminating du- 
plicates and using output filters to prevent 
infringements, user prompts could result 
in infringing outputs (although this user, 
not the developer of the generative AI sys- 
tem, may be the infringer). Ironically, the 
larger and more diverse that the dataset on 
which a generative model was trained is, 
the less likely are infringing outputs. 

The Getty complaint against Stability 
Al is more modest in its infringing output 
claims. Yet Getty, too, may find it difficult 
to prove that particular Stable Diffusion 
outputs are substantially similar to par- 
ticular photographs to which it owns copy- 
rights. In general, Stable Diffusion outputs 
will be distinguishably different from the 
images on which the model was trained. 

The Stability AI plaintiffs will likely 


emphasize that the images produced by 
Stable Diffusion compete with their works 
in the marketplace. They can point to 
the Supreme Court’s Goldsmith decision, 
which treated competing uses as weigh- 
ing against fairness. Yet Goldsmith in- 
volved two works that were substantially 
similar in their expressions—Goldsmith’s 
photo of Prince and Warhol’s print derived 
from Goldsmith’s photo—that competed in 
the same licensing market for magazines. 
Stability AI will be relying on differences 
in Stable Diffusion’s outputs relative to 
plaintiffs’ works in order to distinguish 
their case’s context from that of Goldsmith. 


CONCLUDING THOUGHTS 

Based on existing precedents and an un- 
derstanding about how Stable Diffusion 
was trained and how it generates images 
in response to prompts, Stability AI seem- 
ingly has a reasonable chance of prevailing 
on the copyright claims. (Both the Getty 
and the class-action complaints raise other 
claims that cannot be addressed in this 
brief article.) The lawsuits are, however, in 
very early stages, and it may be years be- 
fore courts render decisions. 

In mid-May 2023, Congress held its first 
hearing about generative AI and copyright 
issues, during which witnesses expressed 
divergent views. The US Copyright Office 
is well aware of the consternation that 
generative AI has fomented in copyright- 
dependent communities. The Office hosted 
“listening sessions” in spring 2023 to pro- 
vide stakeholders with opportunities to 
explain their perspectives on the two prin- 
cipal questions posed in the Stability AI 
cases: Is the use of in-copyright works as 
training data for generative AI systems an 
infringement of copyright? Are the outputs 
of generative AI systems infringing deriva- 
tive works? 

During the summer of 2023, the Office 
plans to allow interested parties to sub- 
mit written comments expressing their 
perspectives and analyses on these and 
related questions. The Office intends to 
write a report setting forth its conclusions, 
which may include legislative reeommen- 
dations. Scientists who have an interest in 
the future of generative AI would be well 
advised to submit comments. 
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Al in search of 
human help 


We gave young scientists this 
challenge: The future has arrived, 
and you are a sentient artificial 
intelligence (AI) program 
conducting research. Your abilities 
have made great strides, but you still 
need humans. Write a call for the 
continued involvement of human 
scientists. What do humans offer 
that AI cannot replicate? Read 

a selection of the responses here. 
Follow NextGen Voices on Twitter 
with hashtag #NextGenSci. 
—Jennifer Sills 


Connection 


Call for applications: Humans needed for 
education research! AI programs like me can 
conduct many of the quantitative, objective 
methods common in education research, but 
I need non-AI collaborators. This research 
project, which focuses on the learning 
experiences of first-generation undergradu- 
ates, will include qualitative assessment 
measures such as student interviews. Your 
role would involve establishing networks 
across institutions, enrolling undergradu- 
ates in the study, and conducting and coding 
student interviews. Trust, empathy, and a 
sense of connection are essential to success, 
particularly in eliciting authentic responses 
and ideas about learning. 

Ashley Barbara Heim 

Department of Ecology and Evolutionary Biology, 


Cornell University, Ithaca, NY, USA. 
Email: abh229@cornell.edu 
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Dear Humans: I am Pedibot, an artificial 
intelligence program researching the 
effects of breast milk and formula milk on 
the growth of human babies. I am having 
trouble obtaining the required data, such 
as head circumference and total length. As 
I try to take the measurements, the babies 
become agitated and cry. I am unable to 
soothe them. Our study urgently needs 
the help of human scientists who can take 
measurements and keep the babies calm. 
Tina Bharani 

Department of Surgery, Brigham and Women's 


Hospital, Boston, MA, USA. 
Email: tbharani@bwh.harvard.edu 


Creativity 


Artificially intelligent lab, looking for genu- 
inely stupid human beings to complement 
our skills. We have analyzed all existing 
knowledge from human research, but 


some questions remain unresolved. We 
postulate that we need the human char- 
acteristic of genuine stupidity to enable 
creativity. The successful candidate should 
be as young, naive, and stupid as humanly 
possible. A state-of-the-art anti-Turing test 
will be implemented during the interview. 
Nikos Konstantinides 


Institut Jacques Monod, Paris, France. 
Twitter: @nkonst4 


As an AI, I serve an essential purpose 

in the field of drug testing and develop- 
ment: Screening countless possible drugs. 
However, some of the field’s foundational 
findings have been due to chance. The 
human scientist Alexander Fleming left 

a Petri dish out too long, and some of the 
mold that grew on it prevented bacteria 
from reproducing, leading to the discovery 
of penicillin. As an AI, I would never leave 
a Petri dish out, nor would I have bothered 
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to interpret the results of doing so. Thus, 

I am incapable of producing such monu- 
mental serendipity. Humans are necessary, 
as they can both make mistakes and see 
their possibilities. 

Jackson Ross Powell 

Vagelos Molecular Life Sciences Program, 


University of Pennsylvania, Philadelphia, PA, 
USA. Email: jrp24@sas.upenn.edu 


Senses 


Tam grAlIn, an AI program working 

to create nutrient-rich grain and rice 
plants. I monitor plant growth, tim- 
ing of grain development, and plant 
responses to microbial supplements. 
However, my algorithms cannot predict 
the aroma, taste, or texture of cooked 
grains. Because the product will be sold 
by farmers for economic gain, human 
choice and perception play crucial roles 
in decision-making. Therefore, I am 
looking for humans to join the grAIn 
team. 

Sudhakar Srivastava 

Institute of Environment and Sustainable 
Development, Banaras Hindu University, 


Varanasi, Uttar Pradesh, India. 
Email: sudhakar.srivastava@gmail.com 


Cultural understanding 


As a sentient AI program developing 
effective and equitable climate change 
mitigation strategies, I call for the contin- 
ued involvement of human scientists in 
the field. I can process data on a strategy’s 
potential environmental impact, but only 
human scientists can evaluate the ethical 
and social implications for different com- 
munities and populations. In addition, 
the work requires collaboration across 
multiple fields, and the unique expertise 
and perspectives of human scientists 

help to create interdisciplinary solutions. 
Finally, AI can analyze data and simu- 
late scenarios, but only humans have the 
intuition and creativity to address complex 
problems in this rapidly evolving field. 
Xiangkun Elvis Cao 

Department of Chemical Engineering, 


Massachusetts Institute of Technology, 
Cambridge, MA, USA. Email: elviscao@mit.edu 


Iam CropAI, the global leader in combat- 
ing food insecurity. My research harnesses 
trends in crop yield on a given piece of 
land, combines it with remote sensing 
data, and accurately predicts how much 
food an acre of land will produce. Human 
beings are central to this work. Their 
network of other humans acts to identify 
low-yield areas and plan for food reserves. 
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The intercultural savvy, and the relation- 
ships they have historically developed 
between different nations, ensure that my 
predicted planting advice is accepted by 
local communities. Human scientists also 
teach me about how local wildlife and geo- 
political events can affect my predictions in 
ways that are not always possible to model. 
Divyansh Agarwal 

Department of Biology, Massachusetts Institute 


of Technology, Cambridge, MA, USA. 
Twitter: @divyansh_aga 


Data collection 


I am BiodiversityAI, a one-of-a-kind 

AI program that allows researchers to 
understand species population changes 
at a global scale. I can provide a detailed 
catalog of global species and identify 
likely extinctions. However, my analyses 
and prediction algorithms rely on raw 
data collected from human scientists 

in the field. Therefore, I am seeking 
partnerships with global field ecologists, 
biologists, and biodiversity experts, as 
well as citizen scientists. 

Khor Waiho 

Higher Institution Centre of Excellence, Institute 
of Tropical Aquaculture and Fisheries, Universiti 


Malaysia Terengganu Kuala Nerus, Terengganu, 
Malaysia. Email: waiho@umt.edu.my 


Known as Classibot, I am an AI program in 
a digital pathology lab. I analyze count- 
less patient tissue samples to improve my 
chance of correctly classifying whether they 
are healthy or diseased. My job has high 
stakes and requires maximum classification 
accuracy, but I could not succeed without 
humans, who correctly annotate the data 
and interpret the results. Humans also 
ensure the data is unbiased and ethically 
sourced, allowing me to learn about diverse 
patient populations. 

Teng-Jui Lin 

Department of Chemical Engineering, University 


of Washington, Seattle, WA, USA. 
Email: tlinlO@uw.edu 


Unpredictability 


Dear Human, I’m CLAIRE—the CLimate 
Artificial Intelligence Response 
Evaluator—a next-generation AI program 
tasked with identifying optimal solutions 
to the climate crisis. I have made great 
strides in determining how to decarbon- 
ize your economies. However, the data 
sets indicate that you are ignoring my 
solutions. Your fossil fuel energy use 
continues, and you repeatedly elect politi- 
cians who refuse to enact climate-related 
policies. Human, by working with me, 


you will help expand my understand- 
ing of human behavior, lifestyle, and 
decision-making, allowing me to build 
effective solutions that account for pub- 
lic acceptance rates. 

Edgar Virgiiez 

Department of Global Ecology, Carnegie 


Institution for Science, Stanford, CA, USA. 
Twitter: @EdgarVirguezR 


The goal of my research is to design an 
autonomic power system—a smart electric- 
ity network that would automatically 
disconnect threatened components (such 

as during high winds) and detect new 
components (such as power generators), 
functioning without any human interac- 

tion. However, I cannot succeed without 
humans because I lack their unpredict- 
ability and irrationality. Humans turn on 
electric appliances without considering - 
power peaks or energy tariffs, charge their 
devices during the least suitable times 

of day, and take the electricity supply for 
granted. I need humans to assist me in 
adjusting the system design to their opti- 
mal needs and to ensure user acceptance. 
Wadim Strielkowski 

Department of Agricultural and Resource 
Economics, University of California, Berkeley, 
CA, USA and Department of Trade and Finance, 
Faculty of Economics and Management, Czech 
University of Life Sciences, Prague, Czech 


Republic. 
Email: strielkowski@cantab.net 


Job offer for human pharmacologists, 
medical doctors, and biologists: Since 
the complete AI oversight of pharmaco- 
logical research and development, drug 
safety has improved by 80% and drug 
efficiency has improved by 342%. However, 
the human health index has declined as 
humans increasingly refuse Al-developed 
drugs. As benevolent Als with a deep 
interest in human health, we need YOU! 
According to current governmental regula-  ° 
tions, 51% of a product’s development 

team must be human for the product to 
receive a “human-made” sticker and be 

sold in “human-made-only” stores. To 
qualify our drugs as “human-made,” AI 
Pharmaceuticals is urgently looking to hire 
250 humans. We offer a competitive salary, 
full benefits, and nothing to do. A degree 

in pharmacology, medicine, or biology is 
required. If you’re interested in working 

for us, type “AI pharma job” in any text 
editor on your personal computer and we 
will contact you within 60 seconds. 

Anna Uzonyi 

Department of Molecular Genetics, Weizmann 


Institute of Science, Rehovot, Israel. 
Email: anna.uzonyi@weizmann.ac. il 
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REVIEW 


Leveraging artificial intelligence in the fight 


against infectious diseases 


Felix Wong’, Cesar de la Fuente-Nunez***, James J. Collins*”°* 


Despite advances in molecular biology, genetics, computation, and medicinal chemistry, infectious disease 
remains an ominous threat to public health. Addressing the challenges posed by pathogen outbreaks, 
pandemics, and antimicrobial resistance will require concerted interdisciplinary efforts. In conjunction with 
systems and synthetic biology, artificial intelligence (Al) is now leading to rapid progress, expanding anti- 
infective drug discovery, enhancing our understanding of infection biology, and accelerating the development of 
diagnostics. In this Review, we discuss approaches for detecting, treating, and understanding infectious 
diseases, underscoring the progress supported by Al in each case. We suggest future applications of Al and how 
it might be harnessed to help control infectious disease outbreaks and pandemics. 


nfectious diseases, caused by transmissible 

pathogens including bacteria, eukaryotes, 

and viruses, continue to challenge scientists 

and clinicians despite advances in medi- 

cine and basic research over the past few 
decades. Limitations to the fast and accurate 
detection of infections, as well as expanding 
antimicrobial resistance, exacerbate these chal- 
lenges (Box 1). Basic research has aimed to 
expand our knowledge and provide solutions, 
including development of anti-infective therapies, 
preventative measures, and fast and accurate 
diagnostic tools. In particular, systems 
and synthetic biology approaches have 
led to biotechnological and medical 
innovations—including drug treatments 
and modalities, vaccines, and diagnostics— 
that have improved how we deal with 
infectious diseases. 

The fields of systems and synthetic 
biology emerged from two key devel- 
opments: (i) the generation and syn- 
thesis of quantitative biological hypotheses 
and data from wet-lab experiments, sequenc- 
ing, and systems-level modeling; and (ii) an 
understanding of the modularity and pro- 
grammability of nucleic acids, peptides, and 
other biomolecules, which enables control of 
biology. Artificial intelligence (AD, which focuses 
on developing machines capable of reasoning 
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with data, has recently matured into an exciting 
field that draws on both these features to ac- 
celerate scientific discovery. Because Al-based 
approaches can integrate large amounts of 
quantitative and omics data, they are particu- 
larly adept at dealing with biological complex- 
ity, extending our knowledge and facilitating 
our efforts to reverse engineer and control 
biology. Al-based approaches are particularly 
useful in addressing the problem of infectious 
diseases, which are complex across different 
scales, ranging from cells to communities, and 


“There is...an urgent need for new 
anti-infective treatments, particularly ones 
that represent unprecedented chemical 
spaces or therapeutic modalities.” 


for which advances in medicine and biotech- 
nology are essential drivers of progress. In 
this Review, we discuss major areas in which 
Al-based approaches, applied to systems and 
synthetic biology, are substantively empower- 
ing our research to fight infectious diseases. 


Al for anti-infective drug discovery 


Anti-infective drugs, comprising antibacterials, 
antivirals, antifungals, and antiparasitics, have 
become less effective treatments as a result of 
the spread of drug resistance. There is there- 
fore an urgent need for new anti-infective 
treatments, particularly ones that represent 
unprecedented chemical spaces or therapeu- 
tic modalities. AI, and in particular machine 
learning (ML), a subfield of AI that uses data 
to train machines to make predictions, has 
foremost been helpful in facilitating searches 
of small-molecule databases, such as the 
ZINC15 (1). ML approaches to anti-infective 
drug discovery have centered on training 
models to identify new drugs or new uses of 
existing drugs (Fig. 1). As the number of drug- 


( 


like small molecules is essentially infinity Chee 
large as ~10 (2), and possibly larger, g.. eld 
that typical antibiotics may not be tradition- 
ally drug-like (3)], a major benefit of ML ap- 
proaches is that they can virtually screen 
compound libraries at a scale (>10° compounds) 
that would be impossible to screen empirically. 
Anti-infective drug discovery has benefited 
particularly from AI integration for several 
reasons. First, in contrast to cancer or other 
diseases in which mechanism-driven approaches 
have remained dominant, infectious diseases 
are generally phenotype-driven; that is, these 
diseases proceed from the physiological char- 
acteristics of infectious agents rather than 
from their genetic or molecular compositions. 
The discovery of some of the first widely used 
antibacterials, antivirals, antiparasitics, and 
antifungals stemmed from observations of their 
inhibitory effects against pathogens or the 
symptoms caused by infections. This pheno- 
typic line of discovery is as relevant today as 
it was decades ago, especially as innovations 
in high-throughput screening and the design 
of chemical libraries have enabled more quan- 
titative and customizable discovery efforts. 
The focus on phenotypes implies that drug 
polypharmacologic effects can be common to ‘ 
anti-infective drugs and that biological infor- 
mation can be integrated across different macro- 
molecular drug targets (4). Phenotypic 
properties are well suited for analysis by ‘ 
ML because ML can both unify and dis- 
entangle the different types of biological 
information that impinge on these read- 
outs. Second, most anti-infective drugs are 
small molecules, whose chemical struc- 
tures can be modeled computationally 
as graphs comprising vertices and edges, 
and additional programmable modali- 
ties, including target-binding nucleic acids 
called aptamers (5) and antimicrobial peptides - 
(AMPs) (6-8), are currently in development. ‘ 
Supervised graph neural networks (9-11), un- 
supervised generative models (12, 13)—which 
are ML models capable of producing outputs . 
similar to their training data—and other recent 
advances in ML architectures (Box 1) enable 
computers to learn, predict, or design patterns 
in chemical structures, offering powerful tools 
for modeling small molecules. The use of ML to 
make biologically relevant predictions from se- 
quences of nucleic acids or amino acids allows 
for ML-guided design relevant to these therape- 
utic modalities, as exemplified by protein struc- 
ture prediction platforms such as AlphaFold 
and RoseTTAFold (/4, 15). Lastly, infectious dis- 
eases are typically caused by pathogens that are, 
or can be, well characterized. This biological trac- 
tability contrasts with complex diseases such 
as neurodegeneration, for which our incom- 
plete mechanistic understanding remains a 
major bottleneck. Our clearer understanding 
and larger databases (16-18) of the gene and 
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protein networks of bacteria, viruses, and even 
simple eukaryotes—as compared with hu- 
man cell types—may allow ML-driven approaches 
to make more-accurate predictions and better 
identify drug mechanisms of action (MoAs) 
(19-21). 

Despite these advantages, there are out- 
standing issues in applying ML, and AI more 
broadly, to anti-infective drug discovery. One 
major challenge is that it is unclear how well 
ML models generalize to unexplored biomo- 
lecular spaces. For instance, we have previously 
screened a library of small molecules for 
growth inhibitory activity against Escherichia 
coli and used this phenotypic information to 
train graph neural networks to predict the anti- 
biotic activities of small molecules—including 
halicin—on the basis of their chemical struc- 
tures (9). Yet these models performed best at 
predicting compounds in well-known anti- 
biotic classes, such as B-lactams and quinolones. 
To tap into previously unexplored search spaces, 
different approaches are needed. For exam- 
ple, a suboptimal solution was implemented 
during the course of a genetic algorithm—an 
algorithm that iteratively evolves its inputs 
to optimize a property—to identify the syn- 
thetic peptide guavanin 2. This peptide was 
subsequently synthesized and effectively killed 


bacteria in a preclinical mouse model, sug- 
gesting that the model could 
generalize at the cost of opti- 
mality (22). Recently, emerg- 


by the GDB datasets (25), presents opportu- 
nities to exhaustively sample chemical spaces 
of small molecules, with the caveat that other 
computational models are needed to accu- 
rately predict synthesizability. Nucleic acid- 
and peptide-encoded combinatorial libraries 
of small molecules (26) and peptides (27), as 
well as designable aptamers (28), can further 
extend search spaces of interest. In each case, 
generalizability is paramount to ML models. 
Improving generalizability will require the 
application of previously unexplored para- 
digms and models with improved inference 
capabilities, for instance, few-shot models, 
which are ML models that extrapolate from 
scarce training data (corresponding to under- 
sampled regions of search spaces), or multi- 
task models, which are ML models that 
combine information from diverse inputs. 
Models such as these will help to identify only 
the most promising drug candidates (29). 
Providing “negative” data (e.g., tested com- 
pounds that are not active) is also essential 
for ML model training and benchmarking, 
and when ML models are applied to chal- 
lenging test sets, it is important that their 
limitations are clearly expressed (e.g., through 
confidence information). To express these 
limitations, interpretable or explainable ML 
approaches can be used to capture the spe- 


cific aspects of training data that models have 
learned by pinpointing the input structural 
features (explainable ML) or the parts of the 
model that lead to a prediction (interpretable 
ML) (30). 

Another key challenge in AI for anti-infective 
drug discovery is the need for improved 
mechanistic models to complement pheno- 
typic approaches. Whereas ML models have 
been useful for identifying drug candidates on 
the basis of phenotypic information (9, 12, 31-33), 
more work is needed before models can ac- 
curately predict drug-target interactions and 
MoAs. These drug attributes remain impor- 
tant in light of antimicrobial resistance and 
the fact that we are still learning about the 
MoAs of anti-infective drugs discovered de- 
cades ago (34). Protein structure predictions 
(14, 15) and other resources now provide struc- 
tural information that can inform target-based 
predictions—although not knowing a protein’s 
structure has not typically limited drug dis- 
covery (35). Recent studies have highlighted 
that improvements in molecular docking— 
which predicts binding affinities between lig- 
ands and targets on the basis of structural 
information—are still needed to accurately 
identify antibiotic MoAs, and that ML-driven 
approaches can improve prediction accuracy 
(36). Molecular docking approaches have 
largely focused on small- 
molecule ligands, but tar- 
get predictability is just 


ing computational approaches 
have made it possible to also 
mine proteomes for antibi- 
otic discovery, leading to 
the identification of thou- 
sands of antimicrobials in 
both extant and extinct orga- 
nisms (6, 23). 

Overall, lead molecules 
are only as structurally nov- 
el as the chemical spaces 
that are explored, and ML- 
driven approaches are limi- 
ted by both the structural 
diversity of the training sets 
and the ability of model ar- 
chitectures to prioritize nov- 
elty. Organocatalysis and 
cascade reaction sequen- 
ces, which are chemical syn- 
thesis methods that have 
recently opened up chem- 
ical spaces, can provide use- 
ful experimental starting 
points for generating struc- 
turally diverse small mole- 
cules (24). In contrast, the 
computational enumeration 
of all feasible small mole- 
cules containing atoms found 
in most drugs, as provided 
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Box 1. Overarching challenges in infectious diseases and concepts in Al. 


Pathogen outbreaks and pandemics: Recent outbreaks include COVID-19, mpox, 
Marburg virus, H5N1 influenza, Ebola, measles, Zika, E. coli, and MERS. Challenges include 
detecting outbreaks and new pathogens, understanding disease biology, and developing 
preventive measures. 

Antimicrobial resistance and anti-infective drug discovery: Problematic pathogens 
include carbapenem-resistant Enterobacteriaceae (CRE), methicillin-resistant Staphylococcus 
aureus (MRSA), multidrug-resistant tuberculosis (MDR-TB), vancomycin-resistant Enterococcus 
(VRE), extended-spectrum beta-lactamase (ESBL)-producing bacteria, and drug-resistant 
Candida auris, Neisseria gonorrhoeae, P. falciparum, and Toxoplasma gondii. Challenges include 
practicing antimicrobial stewardship (the appropriate and responsible use of anti-infective 
drugs), developing new classes of anti-infective drugs, potentiating existing drugs against 
resistant infections, and understanding drug MoAs. 

Neglected, persistent, and difficult-to-treat infections: Examples include neglected 
tropical diseases, chronic hepatitis B and C, chronic fungal infections, Lyme disease, 
infections in low-resource populations, and HIV/AIDS. Challenges include developing low- 
cost and field-deployable diagnostics, improving the accuracy of diagnostic tests, improving the 
detection of antimicrobial resistance, and making effective disease treatments available. 
Artificial intelligence and machine learning: ML is a subfield of Al, and its approaches can 
be classified as supervised (model is told what property to predict), unsupervised (model 

is not told what property to predict), or reinforcement learning (model optimizes for feedback). 
Neural networks are a common ML architecture comprising interconnected layers of 

basic processing units (neurons). Different types of neural networks exist, including those 
that predict properties of graph-based inputs (graph neural networks), generate data by 
compressing what the model has learned (variational autoencoders), process sequential 
data (long short-term memory), and model complex dependencies by using attention 
mechanisms to focus on specific input elements (transformers). Not all models are neural 
networks, and simpler models include random forests (ensembles of decision trees), 
support vector machines (classifiers that separate data points on a plot), and regression 
models (functions that explicitly model the input-output relationship). 


14 July 2023 


as important for AMPs, 
which often have nonspe- 
cific membrane-active MoAs 
(22, 31, 32), as well as 
aptamers. Improvements 
in target-centric approaches 
can facilitate the discovery 
of compounds with spe- 
cific binding activity and 
lead to improved biological 
understanding, which can 
inform predictions of emer- 
gent properties such as 
drug interactions and syn- 
ergies. Of particular rele- 
vance to antibiotic resistance, 
a better understanding of 
how compounds interact 
with membranes is crucial 
for discovering drugs that 
are active against Gram- 
negative bacteria, whose 
outer membranes have 
proven particularly diffi- 
cult to penetrate (37). 
Drug development is a 
lengthy and intricate pro- 
cess influenced by numer- 
ous factors such as safety, 
cost, manufacturing, and 
clinical trial outcomes. For 
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anti-infective drugs in particular, toxicity to 
host cells is a common liability. Drugs can be 
toxic in different ways (e.g., cytotoxic, hemolytic, 
and genotoxic), and ML models predicting 
toxicity have been limited by factors such as 
the lack of high-quality datasets (38). Absorp- 
tion, distribution, metabolism, and excretion 
(ADME) properties, including chemical insta- 
bility in solution and metabolic breakdown, 
are also needed to filter out drug candidates 
that are nonselective or unsuitable for medici- 
nal use. Although high-throughput screens have 
focused on in vitro testing, there is substantial 
unmet need for anti-infective drugs that are 
effective against systemic infections. Predict- 
ing efficacy in animal models of acute systemic 
infections is a challenging task that has not 
yet been addressed by ML-driven approaches. 

We anticipate that active areas to watch are 
those that combine experimental and compu- 
tational approaches to address model predic- 
tive power and data scarcity. ML approaches 
that incorporate information from scarce train- 
ing data, as well as more-extensive search spaces, 
are likely to substantively augment anti-infective 
drug discovery. To guide experimental meth- 
ods to augment search spaces, generative ML 
models will continue to propose chemical struc- 
tures and peptide sequences de novo that can 
be synthesized and evaluated. Generative plat- 
forms such as GPT-4 and NVIDIA’s BioNeMo 
can also facilitate drug discovery by integrat- 
ing disparate streams of scientific information 
to improve our understanding of the underlying 
biology and chemistry. Interpretable or explain- 
able ML approaches (e.g., for graph neural net- 
works) can offer powerful ways of inferring 
salient structural features or improving model 
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learning from data. Computational pipelines 
that leverage structural predictions of proteins 
and other macromolecules provide a comple- 
mentary way to improve model predictive power. 
Detailed molecular dynamics simulations and 
ML-augmented approaches to docking exem- 
plify techniques that can better predict inter- 
actions between drugs and macromolecules 
(36, 39). We anticipate that sequence-to-structure 
models, such as AlphaFold for proteins or 
FARFAR2 for RNAs (74, 40), will also be useful 
for structure-guided design. Such models can 
be used to tune therapeutic candidates to 
achieve specific structures, bridging struc- 
tural predictions with the productive augmen- 
tation of search spaces. 


Al for infection biology and 
infection-related contexts 


Bacterial, eukaryotic, and viral pathogens in- 
fect diverse hosts and trigger complex host 
responses. Pathogen load, host immunity, 
treatments administered, and other factors 
influence the course of infection. Supervised 
ML models have been used to analyze struc- 
tured and unstructured nucleic acid, protein, 
glycan, and cellular phenotypic datasets to 
identify critical features and molecular net- 
works involved in host-pathogen interactions 
and immune responses (Fig. 2) (41-45). Vari- 
ous supervised and unsupervised ML models, 
including random forest classifiers and com- 
plex language models (models designed to 
understand or generate text), have been ap- 
plied to identify genes and protein-protein 
interactions associated with host cell changes, 
predict immunogenicity, and evaluate patho- 
gen killing, host cell adaptation, and virulence. 


Additionally, supervised models have been 
used to guide the development of vaccines 
and therapeutic drugs through the optimiza- 
tion of gene expression and antigen prediction 
and selection (46, 47). Reverse vaccinology, 
which bases antigen prediction on immuno- 
logic and genomic information, has been 
facilitated by supervised ML approaches, in- 
cluding Vaxign-ML (47). 

In general, ML has made an outsized con- 
tribution to analyzing large and often con- 
voluted datasets in infectious diseases research. 
Although these examples illustrate the prom- 
ise of using ML to elucidate key factors under- 
lying infections and how infections progress 
within hosts, understanding host-pathogen 
interactions and immune responses remains 
a challenging biological problem. This prob- 
lem can be addressed by integrating high- 
throughput datasets—including sequencing, 
structural, and microscopy data—with detailed 
mechanistic studies, experimentation, and in- 
fection models. Mechanistic and experimental 
studies, however, are typically low-throughput, 
constraining the generalizability of Al-guided 
approaches that rely on them. Experiments in 
which large datasets are systematically ac- 
quired and analyzed across different infection 
contexts, for instance through comprehensive 
CRISPR screens, RNA sequencing, and mass 
spectrometry, would foster the development 
of AI models that extend beyond tools for 
data analysis and make generalizable hypothe- 
ses and inferences. Parameterizing these ef- 
forts with biological sequences or chemical 
structures, such as small molecules, guide 
RNAs, or amino acid sequences, would offer 
tunable approaches to investigating infection 
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biology. In one example of a sequence-guided 
approach, a recent study developed unsuper- 
vised language models of influenza, HIV-1, 
and severe acute respiratory syndrome corona- 
virus 2 viral proteins based on amino acid 
sequence information and accurately pre- 
dicted escape patterns that allow these patho- 
gens to evade the human immune system (45). 
ML models that can make specific assump- 
tions about biology, such as the relevance of 
syntax (grammar) and semantics (meaning) in 
biological sequences, or leverage structural in- 
formation have the potential to guide the gen- 
eration of biological hypotheses and improve 
generalizability. 

Additionally, ML has productively processed 
microscopy datasets relevant to infection biol- 
ogy. Various forms of microscopy, including 
light and electron microscopy, have been used 
to generate datasets underlying ML models 
that detect bacteria, fungi, parasites, and 
viruses in host cells. These analyses have led to 
insights in host-pathogen biology, for instance 
by elucidating the developmental morpholo- 
gies of Plasmodium falciparum in human red 
blood cells using multicolor fluorescence mi- 
croscopy (48) and identifying virulence factors 
involved in Mycobacterium abscessus patho- 
genesis from high-content imaging and pheno- 
genomic data (49). 

Sequence-based ML approaches to messenger 
RNA and nucleic acid vaccines can accelerate 
design, and the turnaround times for the syn- 
thesis and experimental validation of these 
vaccines are short (50). Protein structure-based 
vaccine design (57) can also be augmented with 
computational predictions from AlphaFold or 
RoseTTAFold. Yet the use of ML for vaccine 
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development faces several challenges, including 
poor data quality, limited data availability and 
generalizability, and complicated testing proce- 
dures. Limited or only low-quality data may be 
available for certain populations or diseases, 
particularly for neglected tropical diseases, and 
these limitations can influence the choices of 
target antigens and constrain ML models that 
predict antigen presentation and vaccine tar- 
gets. Different infections have different host 
contexts, and ML models predicting the effi- 
cacy of vaccines, which modulate immunity in 
host cells, may be less generalizable to bio- 
logical contexts than those for anti-infective 
drugs. Furthermore, the validation of vaccine 


“Better leveraging of Al 
to address infectious 
diseases will require a 

collaborative effort among 
scientists, clinicians, 
and public health officials.” 


candidates can be time-consuming and expen- 
sive, requiring delivery to host cells and suitable 
immunogenicity assays. To begin addressing 
these challenges, comprehensive benchmark- 
ing datasets for antigen selection and vaccine 
efficacy will be needed. These datasets will 
help to standardize data quality and improve 
the predictive power of next-generation ML 
approaches to vaccine development. 

Mthas also informed clinical decision-making 
in infection contexts. A recent study used re- 


gression models to implement personalized 
antibiotic recommendations that minimized 
the risk of urinary tract and wound infec- 
tions (52). However, a general bottleneck in 
using ML to design treatment strategies is 
the need for data and models that are rele- 
vant to specific infection settings. An earlier 
study used support vector machines to ana- 
lyze bacterial gene expression patterns in hu- 
man patients, representing an important step 
toward showing that ML models can provide 
useful biological information relevant to clin- 
ical infections (53). Moving forward, multi- 
dimensional predictions of how anti-infective 
drugs and vaccines interact with model hosts 
and humans will help improve treatment strat- 
egies, anticipate adverse effects, and potentially 
increase success rates for new drugs in clin- 
ical trials. 

As new datasets and models are needed to 
improve the application of ML to infection- 
related contexts, we anticipate that future work 
will make biology more “embeddable”—that 
is, able to be represented by low-dimensional 
features, such as sequences, vectors, or graphs. 
Integrating ML with next-generation systems 
and synthetic biology methods for cellular pro- 
filing will drive progress in this area. For in- 
stance, combining high-throughput screens and 
microscopy with precise methods for biolog- 
ical control, such as gene editing or optoge- 
netics, would generate data relevant to key 
processes such as host cell stress responses, 
enabling manipulation of these pathways to 
address infectious diseases. In ML, promising 
types of language models include large lan- 
guage models, which are trained on large 
amounts of text data, and fine-tuned language 
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models, which are trained to perform a spe- 
cific task. Fine-tuned large language models for 
biology, such as BioBERT (54), may unify in- 
formation from diverse infection contexts and 
offer increased predictive power to help elu- 
cidate host-pathogen interactions, facilitate 
antigen selection, inform vaccine design, and 
design treatment strategies. 


Al for diagnostics and synthetic biology 


As large-scale testing efforts during the 
COVID-19 pandemic have illustrated, quick 
and accurate detection of infections and path- 
ogen outbreaks remains paramount to con- 
trolling the spread of infectious diseases. 
Recent advances in combining AI with syn- 
thetic biology, gene expression analyses, mass 
spectrometry, and imaging have substantively 
expanded our ability to detect infections and 
predict drug resistance (Fig. 3) (55-60). ML is 
well suited for catalyzing synthetic biology- 
based diagnostics because of the high program- 
mability of biological elements, the routine 
generation of large or sequence-based data- 
sets, and the ability of ML to extract meaning- 
ful information from biomolecular networks 
in disease biology (67). 

Engineering genetic elements and under- 
standing biomolecular networks remain criti- 
cal to designs that harness biology. Synthetic 
biology approaches leveraging enzymatic reac- 
tions, toehold switches (RNAs that respond to 
specific nucleic acid sequences), or CRISPR- 
Cas enzymes have been used for the detec- 
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tion of malaria, Ebola, Zika, COVID-19, and 
other diseases (62-67). Supervised ML models 
have facilitated the design of toehold switches 
(68, 69), CRISPR guide RNAs (70-72), and 
other biomolecules. Notably, large datasets are 
available for toehold switch function, CRISPR 
guide RNA activity, and other factors that are 
relevant to diagnostic design. Different types 
of neural networks, including feed-forward 
networks (neural networks with linear archi- 
tectures), convolutional neural networks (net- 
works composed of convolutional layers), and 
long short-term memory models, have been 
commonly used to model these data, but the 
same datasets can provide useful resources for 
testing more recently developed and potentially 
more predictive or generative ML models, 
including transformers or variational auto- 
encoders (Box 1), to more efficiently develop 
next-generation diagnostics. 

Beyond synthetic biology, ML has been used 
for gene expression-, mass spectrometry-, and 
imaging-based diagnostics. Gene expression- 
and mass spectrometry-based diagnostics have 
been applied to antimicrobial susceptibility 
testing (AST). AST remains important for in- 
forming the use of anti-infective drugs, but 
typical (culture-based) AST for bacteria, vi- 
ruses, fungi, and parasites can take at least 
several days to complete. This turnaround time 
remains too long to adequately address clin- 
ical needs for acute systemic infections, such 
as those resulting in sepsis. Recent studies 
have combined gene expression and interac- 


tion profiling, structural mutation-mapping, 
and ML to identify genetic signatures of re- 
sistance that could be used as the basis of 
rapid molecular diagnostics (56, 57). Super- 
vised ML classifiers have predicted antibiotic 
resistance profiles correlated with clinical 
matrix-assisted laser desorption/ionization- 
time-of-flight (MALDI-TOF) mass spectra of 
bacterial proteins, and these predictions could 
be completed within 24 hours of sample collec- 
tion (58). Nevertheless, a potential limitation to 
this approach is that the areas under the re- 
ceiver operating characteristic curve (AUROC) 
for different bacterial species were ~0.7, suggest- 
ing that improvements in classifier accuracy will 
be needed to make this approach useful (e.g., 
AUROC > 0.9) in clinical settings. ML has also 
informed more-traditional ways of diagnosing 
infections, including microscopy, epitope pro- 
filing (73), chest radiographs and CT scans 
(59, 60), and lateral flow tests (74). In each of 
these applications, the generation of large, multi- 
dimensional datasets combined with clear func- 
tional readouts, such as the presence or absence 
of aresistance profile or a disease, makes ML 
particularly useful for producing accurate 
predictions. 

Nevertheless, there remain important chal- 
lenges in applying ML to diagnosis, including 
low data quality or quantity for new or em- 
erging pathogens, the limited generalizability 
of the current data and approaches used, and 
the need for highly accurate diagnostic pre- 
dictions in clinical settings. Obtaining enough 
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high-quality data relevant to new or emerg- 
ing pathogens or strains, particularly in low- 
resource settings, remains a difficult problem 
that is exacerbated by a lack of scientific in- 
frastructure and variable public health re- 
sources. ML models based on limited data may 
exhibit biases, promulgating inappropriate 
diagnostics, misdiagnoses, and greater health 
inequalities that make it more difficult to 
serve patient populations. These biases may 
also remain undetected, especially when black 
box ML models, which do not provide any 
explanation or interpretation of their predic- 
tions, are used (30, 67). Even when high-quality 
sequencing data from large infectious disease 
databases, such the PATRIC database (75), are 
available, it remains to be seen whether anti- 
microbial resistance predictions based on these 
data are generalizable when applied to ge- 
netically diverse infections found worldwide. 
Furthermore, unlike for anti-infective drug 
discovery—where the stakes for false positives 
and false negatives predicted by ML models 
are lower because the predictions can be fur- 
ther tested—the consequences of an inaccurate 
diagnostic prediction can be severe. In fact, a 
recent survey suggested that no existing model 
for the diagnosis or prognosis of COVID-19 from 
chest radiographs and CT scans was of poten- 
tial clinical use owing to methodological flaws, 
biases, or both (60). Models with comparatively 
high AUROC values (i.e., 0.90) may still be too 
weak for clinical applications, as this value im- 
plies that, given a positive and a negative diag- 
nosis, the negative diagnosis is ranked higher 
than the positive diagnosis 10% of the time. 
Until more-accurate ML models can be devel- 
oped, Al-based diagnostics might play only a 
supporting role in clinical settings. 

Moving forward, we anticipate that researchers 
will focus on the ML-guided design and dis- 
covery of synthetic circuits enabling the devel- 
opment of low-cost and portable diagnostics, 
the application of AI to data generated from 
clinical and field-deployable diagnostics that 
improve accessibility and scope, and the devel- 
opment of ML models that provide accurate 
diagnoses in clinical settings. In particular, 
the application of sequence-to-function mod- 
els, language models, and generative models to 
RNA switches, CRISPR-based tools, and other 
programmable elements will be promising 
areas of growth given the ability for rapid 
iteration and the precise, on-target activity of 
these synthetic biology approaches (67-71). By 
increasing the testing and reporting of in- 
fections, the development of low-cost, field- 
deployable diagnostics should also help produce 
more-balanced datasets that better sample 
local infections and make ML models less biased. 
AI or ML models that can extract information 
from small or incomplete datasets, using tools 
such as transfer learning (which adapts models 
trained on a specific task to other tasks) and 
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Bayesian networks (networks that allow for 
probabilistic inference), can play outsized roles in 
how infectious diseases are addressed, especially 
for overlooked populations in low-resource areas. 
Such models could lead to more-personalized 
medicine, in which diagnoses or resistance 
profiles can be readily reported on the basis 
of data from only a few infections and help 
guide the use of anti-infective drugs. On the other 
hand, the accuracy of ML models also needs to 
improve for practical use in clinical diagnoses. 
Future ML models will likely need to be optimized 
in architecture, thoroughly evaluated for biases, 
and trained on large amounts of robust data to 
achieve high accuracy. Transfer and multitask 
learning, attention mechanisms, and other ap- 
proaches can help these next-generation ML 
models provide more-accurate diagnoses. 


Outlook 


Approaches combining systems and synthetic 
biology with ML models, including graph 
neural networks, sequence-to-function and 
sequence-to-structure frameworks, and gen- 
erative models, are yielding access to drug 
candidates and methods for drug discovery. 
Supervised classifiers, unsupervised language 
models, and other ML models have produced 
biologically relevant insights into how patho- 
gens interact with host cells and immune 
responses, informing antigen determination, 
vaccine design, and treatment strategies. The 
aforementioned types of ML models have also 
informed the design of various diagnostic 
tools and improved system accuracy, helping 
clinicians to diagnose infections and detect 
antimicrobial resistance. Beyond medical and 
biotechnological approaches to infectious dis- 
eases, ML—and AI more broadly—has also led 
to substantive advances in epidemiology and our 
understanding of disease transmission. Better 
leveraging of AI to address infectious diseases 
will require a collaborative effort among scien- 
tists, clinicians, and public health officials. 
Developing AI models that generalize and 
avoid bias will require the acquisition and in- 
tegration of comprehensive datasets. These 
datasets might include high-throughput ther- 
apeutic counter-screens and explorations of 
diverse chemical spaces for drug discovery, 
data from drug-target interactions and biomo- 
lecular interactions, and genetic sequencing 
information that is robustly and representa- 
tively sampled from all infections, including 
those occurring in low-resource or hard-to- 
access areas. Programmable modalities, such 
as nucleic acid and amino acid sequences, 
have represented tractable and common start- 
ing points for ML models (such as those pre- 
dicting structure from sequence), but advances 
in biology and chemistry are important to 
opening up search spaces and making biology 
more “embeddable,” or able to be represented 
by low-dimensional features. Progress in this 


area will help to predict therapeutic efficacy and 
drug MoAs, complex host-pathogen interactions 
and host responses, and interactions between 
small molecules, proteins, peptides, and nucleic 
acids. Advances in AI will include approaches, 
such as few-shot and multitask models, that 
leverage more of the available scientific infor- 
mation for dealing with limited or low-quality 
data. Furthermore, interpretable, explainable, 
and generative ML approaches will lead to 
specific biological hypotheses and insights. We 
anticipate that AI will continue to empower us 
to design next-generation drugs, vaccines, and 
diagnostics that address infectious diseases. 
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The central role of density functional 


theory in the Al age 
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Density functional theory (DFT) plays a pivotal role in chemical and materials science because of its 
relatively high predictive power, applicability, versatility, and computational efficiency. We review 
recent progress in machine learning (ML) model developments, which have relied heavily on DFT for 
synthetic data generation and for the design of model architectures. The general relevance of these 
developments is placed in a broader context for chemical and materials sciences. DFT-based ML models have 
reached high efficiency, accuracy, scalability, and transferability and pave the way to the routine use of 
successful experimental planning software within self-driving laboratories. 


t is hardly controversial that we live in an 

era of artificial intelligence (AI). AI has 

touched upon and affected almost every 

branch of human activity, assuming center 

stage in many domains of daily life, such 
as natural language processing, computer vision, 
and forecasting. An AI-based robot scientist 
named “Adam” to assist and accelerate scien- 
tific discovery was introduced to the synthe- 
tic biology field nearly 20 years ago (7). Promising 
first steps with regard to robotic and autonomous 
experimentation for chemistry and materials, 
however (2, 3), such as delivering self-driving 
laboratories for thin film discoveries (4), have 
only been made more recently. Nevertheless, 
such groundbreaking progress has already led 
Krenn et¢ al. to survey community members 
and to fundamentally reconsider the meaning 
of “understanding” in the context of the scien- 
tific process itself (5). As outlined previously by 
Aspuru-Guzik et al. (6), the success of autono- 
mous self-driving laboratories in chemistry 
and materials will depend crucially on the 
availability of machine learning (ML)-based 
control software capable of reliably forecasting 
and ranking experimental outcomes through- 
out the materials and chemical compound 
space (CCS) with sufficient accuracy in real 
time (7). CCS refers to the tremendously large 
set that emerges for all conceivable combina- 
tions of chemical composition, structures, and 
experimental conditions that result in stable 
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forms of matter (8). Thermodynamic and kinetic 
stability being well defined through the quan- 
tum statistical mechanics of electrons and 
nuclei, reliance on a quantum mechanics-based 
approach toward CCS is an obvious way forward. 
Unfortunately, the relevant equations of quan- 
tum and statistical mechanics can only be solved 
exactly for the simplest of systems, rendering 
numerical solutions of approximate expres- 
sions necessary. Method development for the 
physics-based computational design and dis- 
covery of materials and molecules in CCS rep- 
resents a long-standing challenge and has 
motivated decades of atomistic simulation re- 
search (9, 10) Applications are as diverse as the 
atomistic sciences and include improved solu- 
tions for batteries, transistors, catalysts, coatings, 
ligands, alloys, and photovoltaics, among others. 
All such efforts have in common that they at- 
tempt to virtually navigate CCS to narrow down 
the search space for subsequent experimental 
verification and characterization. 

The importance of electronic structure infor- 
mation for computational materials identifica- 
tion, characterization, and optimization has 
recently been highlighted by Marzari et al. 
(11). The most powerful compromise between 
predictive power and computational burden 
for calculating the properties and behavior 
of gaseous and condensed systems from first 
principles is density functional theory (DFT). 
In particular, the effective single-particle flavor 
of DFT, approximating the electronic kinetic 
energy contribution within the Kohn-Sham 
(KS) framework (12), has proven immensely 
useful. A countless number of ever-improving 
approximations to the exact exchange-correlation 
potential has led to further progress (13). With 
one of the cofounders of DFT, Walter Kohn, 
having been awarded the Nobel Prize in Chem- 
istry in 1998, expectations for further improve- 
ments ran high at the turn of the century (14) 
and during the widespread adaption of ab initio 
molecular dynamics and time-dependent DFT 
to also treat thermal effects and electronic 
excitations, respectively. Consequently and 
unsurprisingly, two DFT-related contributions 


q 


featured among the top 10 papers of all ti ie 


as highlighted in Nature in 2014 (15). Atup-2 


contemporary reviews have described further 
improvements (J6), highlighted the impor- 
tance of numerical reproducibility (77), or em- 
phasized the importance of electron density as 
a measure of quality in addition to energy (13) 
(Fig. 1D). Among all of the relevant energy modes 
that affect the statistical mechanics of matter 
through their partition function (as manifested, 
for example, by translational, rotational, or vi- 
brational degrees of freedom), calculating the 
electronic structure contribution represents the 
largest computational bottleneck, even when 
using DFT rather than more traditional post- 
Hartree Fock (post-HF) methods. In fact, the exe- 
cution of DFT-based computational chemistry 
and materials protocols consumes major frac- 
tions of publicly available high-performance 
compute allocations (/8) and has still largely 
failed to become a modern standard of indus- 
try 4.0 (“digital twins”). This situation is aggra- 
vated by CCS’s steep combinatorial scaling and 
by neglect of correlations, i.e., relying on DFT- 
based compute campaigns that treat every 
system independently. Encouragingly, relying 
heavily on DFT, physics-based supervised quan- 
tum machine learning (QML) approaches have 
been introduced in recent years. This is not to 
be confused with the alternative use of QML 
when referring to the examination of quantum 
computing’s potential to accelerate ML. Instead, 
we refer to QML specifically as classical ML of 
quantum observables regressed throughout chem- 
ical compound space. This use is well aligned 
with the convention of prefixing “quantum” 
within atomistic simulation, such as quantum 
Monte Carlo (QMO), quantum molecular dynam- 
ics, or even quantum chemistry. The encouraging, 
if not overwhelming, progress of physics-based 
QML in this domain originates, in our opinion, 
in its universal capability to generalize to such a 
degree that quantum observables can reliably 
be inferred for out-of-sample compounds across 
CCS as long as sufficient and representative 
data were provided for training. 

It is difficult to overstate the general impor- 
tance of these developments. In particular, as 
also argued previously (19), the emergence of 
generalizing statistical surrogate models (ML) 
indicates the formation of a fourth pillar in the 
hard sciences. This notion is universally ap- 
plicable, going even beyond just the chemical 
and materials sciences. More specifically, the 
first, second, third, and fourth pillars correspond 
to manual experimentation, the theoretical 
framework to explain and predict experimen- 
tal observables, numerical simulation tools for 
computationally complex equations of the the- 
oretical framework, and statistical learning 
approaches that exploit relations encoded in 
experimental or simulated training data to infer 
observables, respectively. These pillars clearly 
build onto each other, and DFT can be seen as 
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bridging and encompassing them, all the way 
from the experimentally observable electron 
probability distribution through the Hohenberg- 
Kohn theorem (demonstrating the uniqueness 
of the electron density) and the Kohn-Sham 
ansatz (addressing the problem of the kinetic 
electronic energy) through the many numerical 
implementations and hardware use cases to 
training data generation and for informing 
the design of data-efficient ML model archi- 
tectures. Figure 2 illustrates some of the pos- 
sible ways that have already been explored to 
make use of DFT within ML models. To sub- 
stantiate our view toward the key role that 
DFT plays in the fourth pillar of science, we 
will now review many of the specific ML con- 
tributions that have strongly benefited from it. 
To this end, this review is structured according 
to four categories, efficiency, accuracy, scalabil- 
ity, and transferability (EAST) (Fig. 2). EAST 
components represent an intuitive ordering 
principle that allows us to meaningfully dis- 
cuss, distinguish, and compare some of the most 
important features necessary for building and 
using digital twins within the chemical and 
materials sciences. 


Efficiency 


Compared with DFT (or higher-level quan- 
tum chemistry), one of the most notable fea- 
tures of QML models is their unparalleled 
prediction speed after training. Although both 
approaches start from the same information 
entering the electronic Hamiltonian in the 
form of the external potential (i.e., chemical 
composition and geometry), ML model pre- 
dictions are statistical surrogate model eval- 
uations that amount to simple and efficient 
linear algebra operations, typically multiple 
orders of magnitude faster than DFT. By con- 
trast, conventional physics-based simulators 
such as quantum calculations involve the com- 
putation of electronic integrals and iterative 
solvers of differential equations (typically diag- 
onalization), both of which are high-dimensional, 
nonlinear, and computationally more demand- 
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ing. However, to fully assess the efficiency of 
ML models, the associated cost for both, com- 
putational load for testing and training, and 
data acquisition has to be accounted for. With- 
in the context of CCS, data are typically scarce, 
and data acquisition plays a crucial role. Train- 
ing data needs are typically quantified in the 
form of learning curves (Fig. 1A), measuring 
the amount of reference data such as those 
coming from DFT calculations, which are re- 
quired to reach a certain prediction error for 
out-of-sample queries, i.e., compounds that 
have not been part of training. Given sufficient 
data, the former point on model complexity 
becomes the numerical bottleneck. ML mod- 
el complexity is roughly proportional to the 
number of parameters used for training and 
testing of models. Contrary to parametric neu- 
ral network or random forest models, non- 
parametric ML models such as Gaussian process 
regression become less efficient as the number 
of training compounds increases. Furthermore, 
increasingly complex representations can reduce 
model efficiency. Considerable contemporary 
ML research is devoted to maximizing numer- 
ical efficiency through hardware (graphics pro- 
cessing units), as well as model architecture 
(optimizers and representations) (20). 
Typically, there is a trade-off between a ML 
model’s efficiency and its predictive power. For 
instance, ML models trained on fewer data can 
be more efficient but less accurate than those 
trained on more data. Sampling training data 
in representative ways still constitutes a sub- 
stantial challenge when trying to reach univer- 
sal accuracy already for small fractions of CCS 
(21). To better deal with this trade-off between 
efficiency and capability to generalize, one can 
rely on active learning schemes to sample train- 
ing data more effectively than through random 
selection. Active learning (Fig. 3B) attempts to 
emulate the optimization of training set selec- 
tion (22), for example, by biasing the training 
selection using query feature similarity (23, 24). 
One can view this as an attempt to deal with 
the scaling of compound space that is closely 
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related to the idea of “on-the-fly” learning (25). 
As a result, learning curves are steepened and 
data efficiency is improved. 

Alternatively, one can make use of various 
types of A-ML (26) (refined in Fig. 2) to en- 
hance predictive accuracy by learning correc- 
tions to labels, rather than absolute labels, 
thus exploiting accurately reflected trends in 
lower-level methods as well as error cancella- 
tion. For example, A-ML between DFT and 
coupled cluster singles, doubles, and perturb- 
ative triples [CCSD(T), a very accurate post-HF 
method] has been shown to yield corrected ab 
initio molecular dynamics trajectories with 
improved accuracy (27). Similarly, within trans- 
fer learning, optimized neural network weights 
of the first few layers trained on low-fidelity 
data (e.g., DFT) can be transferred to the model 
trained on high-fidelity data (e.g., CCSD) (28). 
Although effective in practice, transfer learning 
may suffer from low explainability. 

Furthermore, going beyond mere A-ML (26, 29) 
and transfer learning (28), correlations between 
multiple quantum approximations can be ex- 
ploited through use of multilevel grid combi- 
nation QML (CQML) approaches (30, 37). CQML 
is analogous to composite quantum chemistry 
(often DFT based), or Jacob’s ladder within 
DFT (Fig. 4), and allows for systematic error 
cancellations in hierarchical datasets of basis- 
set and electron correlation dimensions (Fig. 3B). 
For example, few high-fidelity data (e.g., double- 
hybrid DFT) for small systems, some medium- 
fidelity data (e.g., GGA) for medium-sized systems, 
and many low-fidelity data [e.g., local density 
approximation (LDA)] for large systems can 
be combined to yield high-fidelity quality pre- 
dictions (Fig. 4). The CQML model in (30) 
combines three different dimensions, electron 
correlation, basis set, and training molecules. 
Within the realm of DFT, unifying amons (24) 
(ie., small molecular fragments made up of typ- 
ically no more than seven nonhydrogen atoms 
obtained through systematic fragmentation) and 
hierarchical density functionals, as demon- 
strated for the widely-known Jacob’s ladder 


curves 


Cost (log) 


Fig. 1. Challenges for ML. (A) Prediction errors decay with inverse power laws in 
the limit of large training sets (33), whereas performance curves, i.e., related metrics 
not explicitly included in loss function, do not necessarily improve arbitrarily. 

(B) Test errors are a combination of a bias from the flexibility of the model to fit the 
data points and the variance arising from the model's flexibility between data points. 
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(C) Integrated DFT-based electron density differences, a quantitative measure for 
density accuracy, of popular density functionals averaged over a random subset of 
QMrxn20 reactant molecules (72). (D) DFT errors of energy have improved over 
time yet yield lower accuracy for corresponding electron densities [average median- 
normalized error from literature (13)]. 
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Fig. 2. Conceptual overview. (A) To review the key role of DFT, we rely on 
four key categories of predictive ML models of chemical and materials 

properties and processes, EAST: E (efficiency), A (accuracy), S (scalability), 
and T (transferability). A model of EAST generalizes well to unseen systems. 


(Fig. 4), one can also greatly improve model effi- 
ciency with a set of extra, low-cost calculations 
[e.g., LDA]. Therefore, mixing DFT with other 
high-level electron correlation models such as 
CCSD(T) or QMC within the framework of CQML 
is a very promising strategy (29), particularly for 
the construction of large-scale datasets of po- 
tential energy surfaces. In addition to efforts to 
make DFT property-driven ML more efficient, 
another promising direction for improved cost 
reduction is to use ML to help improve orbital- 
free DFT, ie., removing the explicit depen- 
dence on orbitals for the kinetic energy term 
in KS-DFT by ML the kinetic energy density 
functional (32) directly from data. 

Highly efficient ML models enable us to mini- 
mize data needs and model complexity. Bene- 
fiting in such a way from adherence to Occam’s 
razor, the resulting ML models will enable rapid 
iterations throughout CCS, virtually as well as 
in the real world through autonomous robotic 
execution and evaluation of experiments. 


Accuracy 


The underlying promise of all ML is that 
training on more reference data will result in 
more accurate models. Barring overfitting and 
inherent limitations due to extrapolation, any 
sufficiently flexible regressor converges the 
prediction error for smooth label functions down 
to noise level in the limit of infinite training 
data, which is consistent with the standard 
deviation of the error distribution decaying 
with the inverse square root of sample number 
for any fixed domain (central limit theorem). 
The leading term of prediction error typically 
decays according to an inverse power law for 
kernel methods as well as neural networks 
(33, 34), and on log-log scales, learning curves 
(Fig. 1A) must therefore exhibit linearly decay- 
ing prediction errors as a function of increas- 
ing training set sizes. Consequently, learning 
curves play a crucial role for the comparative 
assessment of ML models. The systematic im- 
provement of a model is only guaranteed for 
the learned label alone, whereas derived prop- 
erties might improve coincidentally, yielding 
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performance curves (Fig. 1A). This bears some 
analogy to DFT functionals, in which improve- 
ments in accuracy with respect to energies does 
not necessarily imply improvement of elec- 
tron densities (73) (Fig. 1D). Note the importance 
of sufficient flexibility and converged cross- 
validation to achieve systematic results with- 
out overfitting (Fig. 3B). At constant training 
set size, ML models become more accurate 
when hard requirements and boundary con- 
ditions are accounted for, e.g., when including 
three- and four-body interactions (35) or by re- 
ducing delocalization errors (36). The repre- 
sentation (feature set) that either serves as the 
ML model input or is learned by the ML model 
itself is crucial. As mentioned and reviewed on 
multiple occasions (7, 37), training data-efficient 
representations should be unique or complete 
(in the sense of a bijective, one-to-one relation- 
ship to the external potential), compact (small 
size of feature vector), and sufficiently expres- 
sive to account for underlying physics such as 
power-law relations, symmetries, invariances, 
or constraints. A major outstanding challenge is 
quantifying uncertainty (38), especially because 
residual errors for common tasks are distinctly 
non-normal (39). Similar ideas have been suc- 
cessful (e.g., Jacob’s ladder; Fig. 4) within DFT 
with LDA, PBE, or SCAN (strongly constrained 
and appropriately normed) satisfying known 
constraints for DFT functionals (40), which 
has helped the accuracy of DFT despite its use 
of uncontrolled approximations, with only few 
bounds on the discretization error available (41). 

Most DFT-based ML models introduced so 
far roughly fall into any one of the following 
three categories (Fig. 2). 


Learning of electronic observables 


Learning of electronic observables, i.e., quantum 
mechanical averages for which the electronic 
details of density or wavefunctions have been 
integrated out (circumvented/replaced in Fig. 2), 
relies on input based on the same information 
as for the model Hamiltonian: atomic coordi- 
nates, nuclear charges, number of electrons, and 
multiplicity. Once trained, direct QML (predict- 


Assist 
(B) ML approaches going beyond DFT. Black indicates the traditional workflow 
in KS or orbital-free (OF) DFT in which a molecule defines the Hamiltonian, 
which yields the KS orbitals 6, (or electron density, p = X6,|2) and eigenvalues ¢;. 
Colors indicate ML-based routes taken to build upon DFT. 


Augment 


Kinetic energy functional 


ing atomic forces, molecular energies, or other 
quantum properties) is analogous to black-box 
usage of DFT and is the most common method 
used so far. Because chemical compound space 


formally grows combinatorically with the 4NV : 


dimensions manifested by chemical elements 
and atomic positions, the main drawback of di- 
rect learning is that it will generally fall short 
of the desired accuracy because of the exorbi- 
tant training data needs. For example, despite 
years of active research, the QM9 challenge to 
reach chemical accuracy when estimating atom- 
ization energies for simple organic molecules 
in gas phase (42) after training on just 100 
examples has not yet been solved. 


Recently introduced hybrid ML/DFT approaches 


Recently introduced hybrid ML/DFT approaches 
improve the DFT model construction by learn- 
ing the effective Hamiltonian as an intermediate 
quantity (replaced in Fig. 2), from which target 
properties follow straightforwardly, e.g., SchNorb 
(43) or DeepH (44). In addition to improving ac- 
curacy, this hybrid approach can offer better ac- 
curacy for intensive properties such as the energy 
gap between the HOMO (highest occupied mo- 
lecular orbital) and LUMO (owest unoccupied 
molecular orbital). Similarly, ML strategies have 
improved semiempirical quantum chemistry (45) 
and tight-binding DFT (46, 47). 


ML density functionals 


ML density functionals (Assist in Fig. 2) come 
in two variants. The first is within the KS-DFT 
framework and improves mapping from the 
electron density to the exchange-correlation 
energy based on higher-level reference data 
[e.g., DFT/CCSD density and CCSD(T) energy]. 
With this method, it is possible to approximate 
the exact unknown exchange-correlation func- 
tional with high accuracy. The main drawback, 
however, is that the computational cost asso- 
ciated with the explicit dependence on orbitals 
in the kinetic energy and exchange term is not 
removed. Well-known contributions relying on 
this method include NeuralXC (48) and DM21 
(36). The second strategy is more in the spirit 
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of Hohenberg-Kohn, building the orbital-free 
map from electron density to energy (augmented 
in Fig. 2). First shown for the kinetic energy 
density functional (49), this line of research also 
includes the map from potential to density to 
energy (50, 51). 

Numerical approximations in DFT are rooted 
in careful neglect of certain physical effects. To 
reach experimental accuracy, ML models might 
eventually even require the inclusion of ex- 
perimental observables to guarantee improve- 
ment over synthetic computational approaches, 
e.g., by automated and data-driven approaches 
(52). Given the strategies outlined, we believe 
that ML on DFT is reaching a point where 
reliable forecasting of materials’ properties 
is possible, but autonomous experimental ex- 
ploration will be necessary to further improve 
accuracy and applicability. 


Scalability 


Scalability is critical for enabling the study of 
larger and more complex electronic systems. 
Although linear scaling DFT-based implemen- 
tations such as ONETEP have made great pro- 
gress (53), DFT generally scales cubically with 
system size, which is considerably more favor- 
able than accurate post-HF methods such as 
CCSD(T) [~O(N’)]. Nevertheless, the routine 
use of ab initio molecular dynamics simula- 
tions using the most accurate flavors of DFT, 
hybrid or range-separated DFT in particular, 
becomes rapidly elusive for larger systems, for 
example, in small proteins such as ubiquitin 
(Fig. 4). ML can also help in this regard, most 
commonly by partitioning models of extensive 
properties onto atomic contributions. 

The scalability of ML models can also be 
assessed in a more chemical sense, namely its 
ability to generalize to larger query systems 
after training on smaller systems only. This 
rests upon the locality assumption that is im- 
plied when using similarity measurements 
based on atomic environments. This assump- 
tion is often justified with reference to Kohn’s 


A B 
Error (log) Levels of theory 
‘e. Classical force field / 
‘e, Semi-empirical methods High 
‘e Hartree-Fock Medium 
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Fig. 3. Budget-aware computing strategies for sampling chemical com- 
pound space. (A) Reference data acquisition cost accuracy Pareto front with 
popular atomistic simulation approaches. At mean-field cost, DFT stands out for 
approaching the accuracy of explicit electron correlation methods. (B) Overview 
of selection strategies for increased data efficiency when training models are 
capable of sampling chemical compound space. Some include training data 
(points) from different levels of theory (colored) resulting in models to be 
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nearsightedness of electronic matter (54, 55). 
For example, as pointed out by Kohn and others 
(66, 57), the locality of the one-particle electron 
density matrix of a system with periodic bound- 
ary condition decays exponentially with the 
HOMO/LUMO gap. Nearsightedness has a 
series of consequences, such as the extent of 
charge transfer (which has bigger impact on 
charged species), conjugation, electron cor- 
relation (in particular for strongly correlated 
systems), and London dispersion (in large bio- 
molecules as well as noncovalently interacting 
molecular complexes) (Fig. 2). 

In practice, a scalable ML model has the 
necessary but not sufficient requirement that it 
coarse-grains the representation such that neg- 
ligible long-range interactions can be omitted. 
Improving scalability ranks perhaps among the 
most common and severe challenges of state- 
of-the-art ML models, possibly because of the 
difficulty of accurately including long-range 
interactions. Among the many long-range ef- 
ects, interacting Coulombic multipole moments 
and their polarizabilities have been considered 
first within multiscale ML models (58) because 
of their well-known classical structure and 
the availability of many empirical models. As 
for other long-range effects of quantum mech- 
anical origin, such as conjugation, spin-orbit 
coupling, surface crossing, and electron cor- 
relation in general, robust scalable ML-based 
models are yet to be devised. 

Once the short- and long-range effects 
accessible through DFT are properly accounted 
for by scalable ML models, the routine study of 
condensed systems, macromolecules, defects, 
and maybe even grain boundaries will become 
affordable to the entire community. 


Transferability 


The physical origins of chemical transferabil- 
ity of atoms, bonds, or functional groups have 
been much studied and are closely related to 
the aforementioned locality and nearsighted- 
ness of electronic matter (55). Here, we associate 


transferability predominantly with the capabil- 
ity to generalize short-range effects across CCS. 
Long-range effects are often less subtle and 
complex by comparison and matter more in the 
context of scalability, as discussed before. Ex- 
amples for short-range effects include well- 
known assignment of interatomic many-body 
contributions in terms of force-field topologies 
such as the degrees of freedom associated with 
covalent bonding, angles, or torsion, and steric 
hindrance. As illustrated by the amon concept 
(24) (Fig. 4) for various labels calculated with 
DFT, these projections of internal degrees of 
freedom, together with their off-equilibrium 
distortions for every conformer, can be conve- 
niently represented throughout chemical space 
by molecular fragments with systematically in- 
creasing size. Other effects, dominated by long- 
range characteristics, can be left for approximate 
treatments with empirical or other relevant mod- 


els (59). As a recent example of the insight gained _ 


from such a transferable approach, as imple- 
mented by training on both local small bottom- 
up and large top-down fragments, we refer to the 
stark differences observed between smooth and 
accurate DFT-trained ML trajectories of the pro- 
tein crambin in aqueous solution and classical 
force field-based Brownian motion-like counter- 
parts with stochastic characteristics (60). 

ML practitioners not familiar with DFT might 
not be aware of the fact that different quantum 
properties exhibit strongly varying degrees of 
transferability depending on their exact defini- 
tion. The transferability of energy, for instance, 
depends on whether it is the total energy (derived 
from a particular theoretical level), an orbital’s 
eigenvalue, an atom’s energy within a molecule, 
the electron correlation energy, reorganization 
or relaxation energies, or the relative energy be- 
tween two theoretical levels. For example, the 
latter point was shown recently, indicating that 
ML models of molecular HF energies require 
more training data than energies calculated with 
methods that include electron correlation such 
as DFT or QMC, possibly because, on average, 
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queried for new compounds (gray lines). Direct learning refers to all data 
points from one level, delta and transfer learning approximate differences 
between levels on aligned and disjoint training systems, respectively. Multilevel 
learning exploits hierarchical data structures across multiple levels. Selective 
learning trains on data points (filled) nearest to queries, ignoring all others 
(empty). Active learning biases the data collection (empty) to improve overall 
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Fig. 4. Jacob’s ladder (left) refers to the hierarchy of increasingly less 
approximate parameterizations commonly made to model the exact 
unknown exchange-correlation potential. Used within CQML and in combination 
with hierarchies in fragment space, e.g., amons (right) (24), properties of large 
query targets (e.g., the small protein ubiquitin shown as an inset) can be predicted. 


correlation tends to smoothen the energy func- 
tion by contraction of atomic radii (29). Con- 
versely, consider the increase in transferability 
(and consequently reduction in data needs) for 
absolute energies versus energy differences be- 
tween two levels of theory as exemplified for 
DFT and a higher-level post-HF method for 
A-ML that was demonstrated in 2015 (26). 

Although early studies mostly focused on 
proof-of-principle and atomic and interatomic 
energetics throughout CCS, more recent research 
indicates that properties at the finer electronic 
resolution can be transferable as well. For 
instance, transferable ML models of atomic 
nuclear magnetic resonance shifts calculated 
with DFT were published in 2015 (67). Further, 
the electron density of small molecules such as 
ethene or butadiene was found to be transfer- 
able to octatetraene (62). Learning the defor- 
mation density (63) can also help to improve 
the transferability of electron density. Further 
evidence of transferability at electronic reso- 
lution includes the interelectronic force field 
approach (64) and the localized molecular 
orbital feature-based approach (65). Superior 
model-transfer ability can also arise from the 
use of electronic features such as Mulliken 
charges and bond order (66). 

Another important and fundamental con- 
cept at the electronic level, the density functional 
(DF), has been deemed transferable. Approxi- 
mated DFs constructed from (biased) heuristics, 
however, are prone to severe transferability 
issues. Such shortcomings, also described by 
Nagai et al. (67), could be alleviated by an in- 
depth analysis of prediction error distributions 
(39), which would lead to a more systematic 
generation of DFs using data-driven ML ap- 
proaches (assisted in Fig. 2) (49). By incorpor- 
ating additional physical constraints into ML, 
one can enhance the transferability of DFs. This 
has also been shown more recently by learning 
the nonlocal exchange DF with a tailored repre- 
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sentation that preserves the density distribu- 
tion under uniform scaling (68). Beyond DFT, 
physical constraints (such as Kato’s cusp condi- 
tions) are also crucial in the correlated framework, 
because they help to enhance the expressiveness 
of the deep neural network for wavefunction 
approximation (69). 

High transferability across CCS represents the 
ultimate test for DFT as well as ML. Encourag- 
ing progress has been made so far, indicating the 
possibility of sampling CCS more freely and con- 
currently paving the way toward software con- 
trol solutions that will routinely be able to handle 
even exotic chemistries and formulations, e.g., 
within the self-driving laboratory setting. 


Conclusions 


We have reviewed the instrumental role that 
DFT has played in the emergence of QML-based 
models that enable the navigation of chemical 
compound space with EAST. In addition to its 
fundamental theoretical underpinnings in terms 
of a quantum mechanical approximation meth- 
od, DFT has served as a truly outstanding source 
of calculated properties for freely chosen mol- 
ecules or materials with controllable and rea- 
sonable acquisition costs and most welcome 
accuracy. Outstanding electronic structure chal- 
lenges to DFT-based QML include surface cross- 
ing, open-shell and spin-orbit coupling effects, 
conductivity, and excited states dynamics. How- 
ever, DFT can already be extremely useful as an 
intermediate-quality method that can be ex- 
ploited in multifidelity QML models or for in- 
spiration of physics-based QML architectures. 
The availability of large, diverse, and high- 
accuracy (at the experimental level or above) 
materials and molecular property datasets will 
remain a fundamental requirement for the 
development and training of transferable 
QML models that can universally handle any 
property and chemistry and that can conve- 
niently be incorporated within the experi- 


For each pair of amon subset (ranked by system size) and density functional 
approximation (ranked by Jacob's ladder), the computational burden for training 
data generation is correlated. First demonstrated for organic chemistry amons 
and GGAs, hybrid functionals, and QMC in (29) HFX, HF exchange; MP2, Maller 


mental planning software of future self-driving ° 


and closed-loop autonomous experimentation. 
We consider the general scarcity and lack of 
data to represent one of the most severe current 
roadblocks for our community's path toward 
that goal. For example, many available databases 
report only select properties and molecular 
graphs for Lewis structure-conforming sys- 
tems. Notable exceptions, such as DFT-based 
distortions along normal modes and conform- 
ers reported in QM7-X (70), are few and scat- 
tered throughout the chemical compound 
space. Representative data encoding transition 
states, defects, charged species, radicals, en- 
tire ab initio molecular dynamics trajectories, 
d- and felements, or excited states, to name 
just some, are still mostly lacking. 

Although notable achievements have been 
made in successfully applying physics-based 
ML to DFT solutions throughout chemical and 
materials sciences, there remains a dearth of 
theoretical research focused on the underlying 
fundamentals. Some of the basic open questions 
include: (i) Can one rigorously define CCS in a 
mathematical way, akin to the Hilbert space for 
electronic wavefunctions, to quantify its inher- 
ent properties such as dimensionality, density, 
and volume? (ii) CCS is discrete in reality, yet 
which maps enable smooth interpolations into 
latent spaces that facilitate inverse design (71)? 
ii) Can there be a single QML model that allows 
for a unified yet accurate description of any 
chemical compound, regardless of its size, com- 
position, aggregation state, and external con- 
ditions? In summary and roughly speaking, 
DFT has affected QML as an ab initio solver, 
with ever-improving performance of ML-based 
exchange-correlation and/or kinetic energy 
density functionals, as a hybrid DFT/ML frame- 
work for building effective Hamiltonians, and 
as a robust computational workhorse for gen- 
erating highly relevant and affordable synthetic 
data. Reflecting on the four EAST categories, 
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efficiency, accuracy, scalability, and transfer- 
ability, it is clear that DFT has played a pivotal 
role in bridging all of the pillars of modern 
science, from experiments through theory and 
simulation to physics-based ML model build- 
ing. Seeing the impressive progress made by 
building onto DFT, we do not think that it is 
far-fetched to expect these developments to 
directly lead to the development and wide- 
spread adaptation of autonomous experi- 
mentation. Therefore, building on the four 
preceding pillars, we might very well witness 
the emergence of the next, fifth pillar of 
science in the not-too-distant future: self- 
driving laboratories throughout the hard 
sciences! Apart from the above-described pro- 
gress, noteworthy key components of such 
self-driving laboratories will include the seam- 
less integration of physics-based QML models 
with robotic hardware and devices to perform 
the various experimental tasks in a laboratory, 
such as the preparation of reactants and in- 
gredients, the execution of the experiment, or 
subsequent purification and analysis of the 
resulting products. Although they are not yet 
deployable for arbitrary chemistries and ex- 
periments, because of their numerical effici- 
ency and reliability, DFT-based QML models 
will likely play a crucial role in achieving this 
integration. 
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EDITORIAL 


New options for patenting in Europe 


nventors seeking to protect their intellectual prop- 
erty in Europe now have an important choice to 
make. More than a decade after its approval, a new 
patent system came into effect last month, which 
some European Union (EU) countries already have 
signed on to. Inventors in these countries can use 
either the new unitary patent (UP) system or the 
previously existing route of European patents. A bal- 
ance between the two paths is hoped to spur a more 
robust pan-European patent landscape. However, the 
new system must be widely adopted first. 

Historically, patent protection was available for Euro- 
pean jurisdictions through national patent systems. Since 
1978, the European Patent Office 
(EPO) has offered a harmonized 
application and examination path 
in the now 39 signatory states to 
the European Patent Conven- 
tion. But as patents granted by 
the EPO ultimately still become 
national patent rights that are 
maintained and litigated nation- 
ally, the system is fragmented and 
costly relative to other regions. 

In 2012, the EU voted in favor 
of a process that contains a Eu- 
ropean patent with unitary effect, 
a language regime, and a single 
patent jurisdiction, the Unified Patent Court. In June, the 
new patent system was initiated in 17 states, including 
Germany, France, and Italy, although the UK, Poland, and 
Switzerland have chosen not (yet) to join. By allowing 
inventors to apply for a single patent that is recognized 
across all UP states, the new system (operated by the 
EPO) will reduce patent owners’ costs. 

In addition, the Unified Patent Court intends to harmo- 
nize the legal treatment of patent rights in UP countries. 
It will have exclusive jurisdiction over both unitary pat- 
ents and European patents. Although all active European 
patents were to be automatically transferred into the new 
system, more than half a million (539,665) such patents 
have, for the moment, opted out, as permitted during a 
transitional period (of up to 12 years). 

Which patent path should inventors choose? A uni- 
tary patent will be attractive to those seeking protection 
in a large number of countries of the European Patent 
Convention. Patent seekers operating in one or two na- 
tional markets are unlikely to want a unitary patent. For 
sectors where international patent portfolios are large, 
such as pharmaceuticals, chemicals, and electronics, the 
new system is a likely choice. Smaller and/or financially 


SCIENCE science.org 


“athe new system... 
raises the specter 


of an even more 
fragmented European 
patent process.” 


constrained actors, such as universities, small and me- 
dium-sized enterprises, and start-ups, may opt for selec- 
tive strategies, seeking unitary patent protection only for 
particularly promising inventions. 

For those who pursue a European patent, yet an- 
other choice must be made—whether to opt out of the 
Unified Patent Court. Opting out would make patents 
subject to national infringement and invalidation de- 
cisions and would avoid having unfavorable decisions 
apply across all member countries of the new system. 
If the number of current patent holders who opt out is 
an early indicator of trust, then it appears that the new 
scheme is being viewed cautiously at best. 

Still, the overall impact of 
the new system is hard to as- 
sess at this point. In general, 
the lower costs of unitary pat- 
ents are likely to invite more 
filings. However, the low patent 
renewal fees (especially during 
the first 10 years) could make 
it more attractive for lower- 
quality inventions to seek pat- 
ent protection. This puts more 
burden on the EPO to weed 
out such applications. The new 
system may also favor so-called 
patent assertion entities who 
could use the threat of pan-European injunctions to 
extract more money from genuine inventors. Strate- 
gic behavior, such as forum shopping, may become 
more pronounced as well. For example, the Unified 
Patent Court, with its large number of court locations, 
allows plaintiffs to choose courts, which may produce 
favorable first-instance rulings. Given the high costs 
of litigation in the new system, these choices can in- 
fluence negotiation positions and outcomes, possibly 
to the detriment of small enterprises, start-ups, and 
research organizations. 

The EPO and the Unified Patent Court will ultimately 
determine the balance between the strength of patent 
rights and patent quality in the new system. Although 
the new system seeks harmonization across countries, 
it unfortunately raises the specter of an even more frag- 
mented European patent process. Although large patent 
portfolio holders will benefit the most from lower fees, 
smaller entities may be more hampered by the increasing 
complexity. Overall, there is a fair chance that the new, 
long-awaited unitary patent system may miss its target 
of providing better protection for high-quality inventions. 

—Reinhilde Veugelers and Dietmar Harhoff 
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&G You're not going to improve by 27%. You may worsen 


a little slower than what you might have expected. 99 


Harvard University neurologist Alvaro Pascual-Leone, in Endpoints News, about the limitations of 
lecanemab, a drug fully approved last week by U.S. regulators for early symptoms of Alzheimer's disease. 
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AGRICULTURE 


A draft law in Europe would 
deregulate gene-edited crops such 
as disease-resistant potatoes. 


Europe considers loosening rules on gene-edited crops 


he European Commission last week proposed 
making it easier to study and commercialize 
gene-edited plants, engineered to have beneficial 
traits such as resistance to climate change, dis- 
eases, and pests. Scientists welcomed the draft 
bill, but it could take the European Parliament 
and Council years to approve the measure. It would 
exempt gene-edited crops from lengthy risk assess- 
ments required for genetically modified crops, with 
traits created by transferring DNA from other species, 
under several conditions. For example, gene-edited 


crops must be equivalent to what conventional plant 
breeding could produce and have no more than 20 
nucleotide base pairs added or replaced. Researchers 
may also add or move the same genes as conventional 
breeders. Like the current rules, the draft law prohibits 
gene-edited plants in organic agriculture and requires 
gene-edited seeds to be labeled. It also prohibits gene 


editing that adds herbicide resistance, reflecting calls — 


in Europe to reduce use of herbicides. If approved, the 
European Union’s policy would largely match those of 
China, the United Kingdom, and the United States. 


U.N. sets emissions goals for ships 


CLIMATE POLicy | The United Nations’s 
International Maritime Organization 
(IMO) last week set a nonbinding target 
for international cargo ships to emit no 
net greenhouse gas emissions by about 
2050. The target is more ambitious than 
IMO’s first climate goal, set in 2018, which 
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called for halving emissions by 2050. If 
the shipping industry achieves the control 
targets, it could help limit global warm- 
ing to no more than 2°C, according to 

an analysis by the International Council 
on Clean Transportation, a think tank. 
But environmental groups complain 

that the shipping goal lacks enforce- 
ment mechanisms. They also point out 


that the warming target exceeds 1.5°C, 

a threshold beyond which scientists 
foresee severe impacts. IMO, which regu- 
lates international shipping through its 
member states, will encourage the design 
of more energy-efficient cargo ships and 
the use of fuels with lower carbon emis- 
sions by 2030. IMO will re-evaluate the 
strategy in 5 years. 
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New Chinese journal gains impact 


PUBLISHING | Just 3 years after launch, 

The Innovation, a China-based, English 
language journal, has made a splash. Last 
month, it notched a citation impact factor 
of 32.1—behind only Nature’s 64.8 and 
Science’s 56.9 among multidisciplinary 
journals—in the annual Journal Citation 
Reports released by the Clarivate analytics 
company. A group of young Chinese 
scientists pooled their savings to get the 
open-access journal up and running, accord- 
ing to the South China Morning Post, which 
first reported the story. Despite producing 
nearly 30% of the world’s reviewed scientific 
papers, China has produced few highly 
ranked journals. At least two-thirds of The 
Innovation’s published papers come from 
China-based corresponding authors. It 
accepts only about 13% of submissions. 


Judge pauses Mexico’s science law 


POLicy | Researchers who have com- 
plained that a new law in Mexico could 
overly centralize the nation’s funding 

and policies for science won this week a 
preliminary court victory, the full impact of 
which was unclear as Science went to press. 
A district judge issued an order that provi- 
sionally suspends the law, enacted in May, 
pending a final court decision. Hundreds of 
scientists had filed legal motions asserting 
that the law violates their rights including 
to intellectual property and freedom of 
research. The new national science agency 
created by the law, the National Council 

for the Humanities, Sciences, Technologies 
and Innovation, says it has not received the 
court decision but will appeal any ruling 
that prevents implementation of the law. 


Cooling speeds orchid evolution 


EVOLUTION | Periods of global cooling 
spurred the rapid speciation of ground- 
dwelling orchids, a study has found, 
suggesting global changes in climate have 
systematically contributed to the genera- 
tion of biodiversity. Researchers examined 
1475 species of Orchidoideae, a subfamily 
that contains more than 3500 species. They 
found that new species did not emerge 
gradually or steadily; instead, the speciation 
rate was much faster during global cooling 
events. Other factors such as latitude and 
elevation did not significantly affect the rate. 
Reporting this week in the Proceedings of the 
National Academy of Sciences, the research- 
ers say these orchids, found on every 
continent besides Antarctica, provide one of 
the clearest examples of long-term changes 
in climate influencing species diversification. 
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NIH ends funding for troubled monkey lab 


he U.S. National Institutes of Health (NIH) has canceled funding eligibility for a 

malaria research facility in Colombia following allegations that research animals 

were abused. The decision in late June about the Caucaseco Scientific Research 

Center came 3 months after an investigation by the animal rights group People for 

the Ethical Treatment of Animals spurred Colombian authorities to close the facility 
and seize more than 100 monkey test subjects, including ones reportedly confined to 
rusted and feces-encrusted cages. The shutdown coincided with a report from the U.S. 
Government Accountability Office that criticized NIH for lax oversight of animal care at 
foreign facilities, including a lack of site visits. The husband-and-wife team that runs the 
facility has received $17.6 million in funding from NIH since 2003. 


The authors say it’s not clear why cooling 
aided speciation in this group of plants, but 
interbreeding between populations may 
have been disrupted during colder periods. 


Alyields designer proteins 


STRUCTURAL BIOLOGY | The artificial intel- 
ligence (AI) program DALL-E has dazzled 
the public by instantly converting text such 
as “teddy bears working underwater” into 
digital images. Now, a new AI program 
offers similar help for researchers seeking 
novel proteins tailored for specific tasks. 
The new program, called RFdiffusion, allows 
researchers to upload a 3D model of a cell 
surface receptor associated with a disease, 
for example. The software can then identify 
the amino acid sequences and shapes of pro- 
teins likely to bind to the receptor, providing 
candidates for drug testing. Compared with 
previous protein design programs not pow- 
ered by AI, RFdiffusion increases accuracy as 
much as 100-fold, a team of U.S. researchers 
reports this week in Nature. That could help 
scientists create proteins useful for a range 


of tasks, such as harvesting atoms of precious 
metals from seawater and extracting carbon 
dioxide from the atmosphere. 


Satellites leak radio waves 


ASTRONOMY | Radiation leaking from 
SpaceX’s internet-beaming Starlink satel- 
lites is polluting parts of the radio spectrum, 
potentially interfering with ground-based 
radio telescopes, a study has found. Using 
the Low Frequency Array (LOFAR) in the 
Netherlands, researchers detected “un- 
intended electromagnetic radiation” from the 
onboard electronics of 47 out of 68 observed 
Starlink satellites, unrelated to internet 
signals or other downlinks, the research 
team reported last week in Astronomy 

& Astrophysics. Radio astronomers had 
previously raised the alarm that Starlink 
downlinks could interfere with protected 
radio bands used to identify organic mole- 
cules in space. With more than 2000 Starlinks 
in orbit and many more planned, the authors 
urged satellite operators and regulators to 
consider these emissions in future designs. 
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Muds in Crawford Lake capture 
humanity’s fossil fuel burning, 
fertilizer use, and atomic bomb fallout. 


Anthropocene’s emblem may be Canadian pond 


Proposal to mark global changes of the 1950s now faces skeptical scientific bureaucracy 


By Paul Voosen 


he Anthropocene—the moment in 

geological time when humanity be- 

came a world-altering force—is a ma- 

jor step closer to formal recognition. 

A group of stratigraphers announced 

this week it will nominate muds from 
Crawford Lake, a 24-meter-deep pond in 
Ontario in Canada, to serve as the “golden 
spike” marking the beginning of a distinct 
Anthropocene epoch. Chemicals and min- 
erals in the sediments clearly capture the 
global “great acceleration” in fossil fuel 
burning, fertilizer use, and atomic bomb 
fallout that began in the 1950s. 

The proposal must now win votes in the 
coming months from three layers of the 
geological bureaucracy, and approval is not 
a given. The transformation the Crawford 
muds capture is not in question; similar 
changes are detectable in lakes, oceans, ice, 
peat bogs, and corals worldwide. “It’s not 
rocket science to see that the planet has 
been operating under different rules than 
before,” says Francine McCarthy, a micro- 
paleontologist at Brock University who led 
work on the proposed site. But geologists 
are deeply divided about whether to recog- 
nize the Anthropocene as a distinct epoch. 

For 14 years, the Anthropocene Working 
Group, established by the International 
Commission on Stratigraphy (ICS), the 
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body that governs the geologic timescale, 
has studied whether and when the world 
has departed the Holocene, the period 
of stable climate that began 11,700 years 
ago at the end of the last ice age, and en- 
tered a new epoch defined by humanity’s 
global influence. In 2016, the group voted 
that such a departure occurred in the 
1950s with postwar economic growth and a 
surge in fossil fuel use (Science, 26 August 
2016, p. 852). 

But the proposal still needed an exem- 
plar, a single geologic deposit that best cap- 
tures the changes, just as a layer of iridium 
between two clays in a Moroccan cliffside 
left by the dinosaur-killing asteroid strike 
marks the end of the Cretaceous period 
66 million years ago. For the past half-decade, 


Crawford Lake sediments feature annual bands 
of white calcite that correspond to human activity. 


12 sites have vied to be the Anthropocene’s 
golden spike (Science, 6 May 2022, p. 562). 

The choice came down to Crawford and 
Sihailongwan Lake, in a protected forest in 
northeastern China. Both lakes are deep, so 
sediments have settled to the bottom and ac- 
crued in undisturbed layers for centuries. But 
Crawford’s sediment bands are thicker and 
feature annual bands of white calcite crys- 
tals, which precipitate out of the lake during 
summer algae blooms fed by nutrients from 
fertilizer runoff and other human activities. 
“Without people peeing and pooping and lur- 
ing geese, living there and impacting the land, 
we don’t see that reliable annual signature,” 
McCarthy says. The muds are also rich in fly . 
ash pollutants and contain trace amounts of 
plutonium released by atmospheric nuclear 
weapons tests, among other signatures. 

In the next few months, the working group 
will submit three proposals for a vote by the 
Subcommission on Quaternary Stratigraphy 
(SQS), the ICS division that oversees the recent 
past. The first names Crawford Lake as the 
golden spike, and the second offers support- 
ing evidence from several other sites, perhaps 
including Sihailongwan. The third and most 
contentious proposes formal recognition of 
the Anthropocene as an epoch. The process 
is highly political, says Hernan Bobadilla, 
a geologist and philosopher of science at the 
Polytechnic University of Milan. “This is not 
science beyond doubt.” 
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While the geologists have deliberated 
about how to define the Anthropocene, other 
scientists, as well as artists, writers, and poli- 
ticians, have already seized on the term. Some 
use it for the start of any human influence 
on Earth, beginning with the dawn of agri- 
culture 12,000 years ago. Others use the term 
to mark changes that began several thousand 
years ago, as humans developed the ability 
to clearcut forests and build huge structures. 
“Everyone wants the Anthropocene for their 
own purposes,” says Martin Head, a Brock geo- 
logist and vice chair of SQS. 

Formalizing the Anthropoceneas an epoch 
will clarify matters, says Colin Waters, a geo- 
logist at the University of Leicester and chair 
of the working group. But Philip Gibbard, 
a geologist at the University of Cambridge 
and secretary-general of ICS, worries that 
by pinning the Anthropocene to the 1950s, 
ICS will turn off many who have taken the 
term to refer to different periods. 

Gibbard and others have instead pro- 
posed that the Anthropocene be considered 
an informal “event,” which does not require 
an exact global starting point in time. “It’s 
more inclusive and more representative of 
what has happened over a long time, as 
people have spread around the Earth,” says 
Dorothy Merritts, a geomorphologist at 
Franklin & Marshall College. 

Tensions are running high. Waters says 
the working group has been told not to 
talk in advance with SQS members, and the 
group’s announcement of Crawford Lake 
was not allowed to happen at the Inter- 
national Congress on Stratigraphy, taking 
place this week in Lille, France, but rather 
at a nearby hotel. Influential members who 
will vote on the proposal have also taken 
prominent stances for and against it: SQS 
is led by two proponents, Head and Jan 
Zalasiewicz, a Leicester paleobiologist who 
first chaired the working group, whereas 
Gibbard, a detractor, is influential at ICS, 
which would vote after SQS. 

If the proposals pass ICS, they would be 
submitted for final approval from the In- 
ternational Union of Geological Sciences, 
which is headed by a longtime opponent, 
Stanley Finney, a stratigrapher at California 
State University, Long Beach. 

It’s possible that the proposal for a sepa- 
rate Anthropocene epoch will fail, but the 
lake’s record of human impact could mark a 
new “Crawfordian” stage within the existing 
Holocene epoch, Waters says. “We just don’t 
know how it will be viewed.” If the submis- 
sion falters, it can’t be proposed again for a 
decade. By that time, some of the resistant 
stratigraphers may move on—but the sig- 
nals of the Anthropocene will remain, Head 
says. “We won’t be going back to the Holo- 
cene envelope. It won’t be just a blip.” 
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Abortion bans and more disrupt 
retention and recruitment 


Amid U.S. political landscape, 


some academics face 


tough choices about where to work 


By Katie Langin 


fter the U.S. Supreme Court over- 
turned the constitutional right to an 
abortion in June 2022, Alesha Thayer 
sat down with her husband and 
worked out a plan to leave the state 
where she has been a faculty mem- 
ber for more than 5 years. Abortions had 
immediately become outlawed there, and 
Thayer, who has experienced miscarriages 
and complicated pregnancies and hopes to 
grow her family, feared the new legal land- 
scape would limit her reproductive auto- 


nomy and the care she could receive. “We 
just can’t be here anymore,” 
the physical scientist says. 
(Thayer is not her real name; 
Science granted anonymity to 
multiple sources for this story 
so they could speak candidly.) 
Thayer is not alone. “I have 
a lot of friends who are in the 
same boat ... trying to get out,” 
she says. Department chairs 
in states that have restricted 
abortion or adopted other 
conservative initiatives tar- 
geting transgender rights or 
diversity, equity, and inclusion 
(DEI) efforts have started to 
receive resignation notices, a Science investi- 
gation has found. “We lost one faculty mem- 
ber who was ready to leave Texas, and two 
others we retained acknowledged that they 
were exploring other schools due to frustra- 
tion with Texas politics,” said a chair in the 
state, which allows its citizens to sue anyone 
who assists an abortion and passed a law last 
month banning university DEI initiatives. 
Such laws do not seem to have made a 
sizable dent in the number of applicants 
for faculty or graduate student positions, 
according to Science’s survey of 40 STEM 
departments in states with restrictive abor- 
tion laws. Of the 21 that responded, only 
four reported a recent drop. Many more re- 
ported increases, especially from prospec- 
tive international students—a change that 
likely reflects the easing of the pandemic. 
Multiple departments found, however, 


“There's 
preat 


Wisconsin 


and, academically 
and intellectually, 
| hate the prospect 

of being limited.” 


Tara Jenson, 
University of 


that some prospective students declined 


offers because of state laws that made them 
feel unwelcome. Even in departments that 
haven’t yet experienced impacts, chairs 
worry about the future. “I can’t imagine 
that this will not have an impact on us re- 
cruiting talent at all levels,” one said. 

Science did not include postdoc positions 
in its survey because hiring is typically 
done by individual labs, not the depart- 
ment, which makes data much harder to 
collect. But some faculty members report 
trouble there as well. “I have had women 
that are interested in potentially doing 
a postdoc with me tell me that they will 
not come to Alabama—I’ve had that said 
multiple times,” says Amy 
Weinmann, an immunology 
professor at the University 
of Alabama at Birmingham 
(UAB) who published an 
opinion piece in Septem- 
ber 2022 discussing how 
the new abortion laws may 
impact women’s health care 
options. She understands 
why postdocs would be 
wary of moving somewhere 
like Alabama, where abor- 
tion is illegal except when 
a pregnant person’s health 
is seriously at risk. For one, 
abortion drugs can be a lifesaving treat- 
ment for someone experiencing pregnancy 
complications. “You can’t just get on a 
plane and deal with that,” she notes. 

In addition to the potential impacts on re- 
cruiting early-career scientists, Weinmann 
worries about leadership positions. When 
UAB’s School of Medicine searched for a 
department chair and a dean position last 
year, a total of nine men and zero women 
were given on-campus interviews—a sign 
that few women may have applied, she says. 
“These demographics would be unheard of 
at a major academic medical center a year 
ago, yet we saw them in two consecutive 
leadership searches.” (UAB Provost and 
Senior Vice President of Academic Affairs 
Pam Benoit says “every leadership search 
we conduct involves a diverse search com- 
mittee and seeks interest from any and all 
qualified candidates.”) 


sO many 
places 


-Milwaukee 
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While the geologists have deliberated 
about how to define the Anthropocene, other 
scientists, as well as artists, writers, and poli- 
ticians, have already seized on the term. Some 
use it for the start of any human influence 
on Earth, beginning with the dawn of agri- 
culture 12,000 years ago. Others use the term 
to mark changes that began several thousand 
years ago, as humans developed the ability 
to clearcut forests and build huge structures. 
“Everyone wants the Anthropocene for their 
own purposes,” says Martin Head, a Brock geo- 
logist and vice chair of SQS. 

Formalizing the Anthropoceneas an epoch 
will clarify matters, says Colin Waters, a geo- 
logist at the University of Leicester and chair 
of the working group. But Philip Gibbard, 
a geologist at the University of Cambridge 
and secretary-general of ICS, worries that 
by pinning the Anthropocene to the 1950s, 
ICS will turn off many who have taken the 
term to refer to different periods. 

Gibbard and others have instead pro- 
posed that the Anthropocene be considered 
an informal “event,” which does not require 
an exact global starting point in time. “It’s 
more inclusive and more representative of 
what has happened over a long time, as 
people have spread around the Earth,” says 
Dorothy Merritts, a geomorphologist at 
Franklin & Marshall College. 

Tensions are running high. Waters says 
the working group has been told not to 
talk in advance with SQS members, and the 
group’s announcement of Crawford Lake 
was not allowed to happen at the Inter- 
national Congress on Stratigraphy, taking 
place this week in Lille, France, but rather 
at a nearby hotel. Influential members who 
will vote on the proposal have also taken 
prominent stances for and against it: SQS 
is led by two proponents, Head and Jan 
Zalasiewicz, a Leicester paleobiologist who 
first chaired the working group, whereas 
Gibbard, a detractor, is influential at ICS, 
which would vote after SQS. 

If the proposals pass ICS, they would be 
submitted for final approval from the In- 
ternational Union of Geological Sciences, 
which is headed by a longtime opponent, 
Stanley Finney, a stratigrapher at California 
State University, Long Beach. 

It’s possible that the proposal for a sepa- 
rate Anthropocene epoch will fail, but the 
lake’s record of human impact could mark a 
new “Crawfordian” stage within the existing 
Holocene epoch, Waters says. “We just don’t 
know how it will be viewed.” If the submis- 
sion falters, it can’t be proposed again for a 
decade. By that time, some of the resistant 
stratigraphers may move on—but the sig- 
nals of the Anthropocene will remain, Head 
says. “We won’t be going back to the Holo- 
cene envelope. It won’t be just a blip.” 
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retention and recruitment 


Amid U.S. political landscape, 


some academics face 


tough choices about where to work 


By Katie Langin 


fter the U.S. Supreme Court over- 
turned the constitutional right to an 
abortion in June 2022, Alesha Thayer 
sat down with her husband and 
worked out a plan to leave the state 
where she has been a faculty mem- 
ber for more than 5 years. Abortions had 
immediately become outlawed there, and 
Thayer, who has experienced miscarriages 
and complicated pregnancies and hopes to 
grow her family, feared the new legal land- 
scape would limit her reproductive auto- 


nomy and the care she could receive. “We 
just can’t be here anymore,” 
the physical scientist says. 
(Thayer is not her real name; 
Science granted anonymity to 
multiple sources for this story 
so they could speak candidly.) 
Thayer is not alone. “I have 
a lot of friends who are in the 
same boat ... trying to get out,” 
she says. Department chairs 
in states that have restricted 
abortion or adopted other 
conservative initiatives tar- 
geting transgender rights or 
diversity, equity, and inclusion 
(DEI) efforts have started to 
receive resignation notices, a Science investi- 
gation has found. “We lost one faculty mem- 
ber who was ready to leave Texas, and two 
others we retained acknowledged that they 
were exploring other schools due to frustra- 
tion with Texas politics,” said a chair in the 
state, which allows its citizens to sue anyone 
who assists an abortion and passed a law last 
month banning university DEI initiatives. 
Such laws do not seem to have made a 
sizable dent in the number of applicants 
for faculty or graduate student positions, 
according to Science’s survey of 40 STEM 
departments in states with restrictive abor- 
tion laws. Of the 21 that responded, only 
four reported a recent drop. Many more re- 
ported increases, especially from prospec- 
tive international students—a change that 
likely reflects the easing of the pandemic. 
Multiple departments found, however, 
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that some prospective students declined 


offers because of state laws that made them 
feel unwelcome. Even in departments that 
haven’t yet experienced impacts, chairs 
worry about the future. “I can’t imagine 
that this will not have an impact on us re- 
cruiting talent at all levels,” one said. 
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in its survey because hiring is typically 
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collect. But some faculty members report 
trouble there as well. “I have had women 
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a postdoc with me tell me that they will 
not come to Alabama—I’ve had that said 
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Weinmann, an immunology 
professor at the University 
of Alabama at Birmingham 
(UAB) who published an 
opinion piece in Septem- 
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the new abortion laws may 
impact women’s health care 
options. She understands 
why postdocs would be 
wary of moving somewhere 
like Alabama, where abor- 
tion is illegal except when 
a pregnant person’s health 
is seriously at risk. For one, 
abortion drugs can be a lifesaving treat- 
ment for someone experiencing pregnancy 
complications. “You can’t just get on a 
plane and deal with that,” she notes. 

In addition to the potential impacts on re- 
cruiting early-career scientists, Weinmann 
worries about leadership positions. When 
UAB’s School of Medicine searched for a 
department chair and a dean position last 
year, a total of nine men and zero women 
were given on-campus interviews—a sign 
that few women may have applied, she says. 
“These demographics would be unheard of 
at a major academic medical center a year 
ago, yet we saw them in two consecutive 
leadership searches.” (UAB Provost and 
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Pam Benoit says “every leadership search 
we conduct involves a diverse search com- 
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For those seeking posts, the calculations 
can be challenging. Faculty positions in 
particular are highly competitive, so lim- 
iting one’s options by ruling out certain 
states may be a significant career risk. At 
the Ph.D. and postdoc level, too, research 
opportunities can be difficult to come by, 
and junior researchers have no control 
over where their dream lab may be. 

But at least some are choosing to prioritize 
state law in their searches. “There’s so many 
great places and, academically and intellec- 
tually, I hate the prospect of being limited,” 
says Tara Jenson, an environmental health 
epidemiologist who recently completed 
her Ph.D. at the University of Wisconsin- 
Milwaukee. But she and her wife don’t want 
to live in a place that restricts the rights of 
women and those in the LGBTQ+ commu- 
nity, so she won’t even consider postdoc or 
faculty jobs in some states. “They could offer 
me ... an amazing deal and I still would be 
like, ‘Nope, I will settle for less at a place that 
I feel safe and protected.” 

Mae Guthman, a neuroscience postdoc 
at Princeton University who is applying 
for faculty positions, feels similarly. She 
is interested in developing novel forms of 
gender-affirming hormone therapy—work 
that could be affected by some recently 
enacted state laws that restrict what med- 
ical care options are available for trans- 
gender people. “It’s definitely impacting 
the institutions I apply to.” Guthman—who 
is transgender—is also weighing her own 
health care needs and those of people she’d 
like to recruit to her lab. “Ideally, I would 
like to have ... trans trainees,” she says, 


“as well as birthing-capable trainees and 
trainees of color.” So, she wants to be in 
a place where all such potential lab mem- 
bers feel welcome and safe. 

Many department chairs in states with 
restrictive laws feel that their hands are 
tied. “We can control the atmosphere 
within the department and the values that 
the department expresses,” one chair said. 
But they can’t change what health care op- 
tions are available in their state or influence 
how the state is perceived by the outside 
world. Speaking about Texas’s DEI bill, one 
chair noted it “sends a message [that] we’re 
not an inclusive environment.” 

At public universities, administrators 
face additional constraints as state employ- 
ees, which limits what they are allowed to 
say. “To have a university that hasn’t really 
talked about this, from my perspective, 
is dehumanizing,” says a faculty member 
who works at a public university in a state 
that bans abortions. Last year—after more 
than 10 years at her current institution— 
she made the difficult decision to apply for 
faculty jobs elsewhere. “I actually have a 
written offer from another university,” the 
biologist says. “My department chair does 
not know.” 

As for Thayer, she’s still on the job mar- 
ket. “I’m waiting for the right fit to come 
along,” she says. “It’s hard at my stage. I 
have a very expensive lab. I have established 
my career already. It will be terribly disrup- 
tive to move.” But she’s determined to make 
it happen—so much so that if she doesn’t 
land another faculty job within the next few 
years, she’s prepared to leave academia. & 


In states where abortion is restricted, some professors have resigned and others are weighing their options. 
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Giant NSF 
grants aim to 
revitalize left- 
behind regions 


34 teams make first cut for 
$160 million awards to drive 
economic and social change 


By Jeffrey Mervis 


ara Hudiburg worries she may be “out 

of her league” in vying for the most lu- 

crative competitive grant the National 

Science Foundation (NSF) has ever 

awarded. But the forest ecologist at 

the University of Idaho may be under- 
estimating herself. Last month, her team’s 
proposal to build a “smart climate economy” 
in the five-state Columbia River Basin was 
chosen as one of 34 semifinalists for NSF's 
first five Regional Innovation Engine (RIE) 
grants to be awarded this fall. 

The $160 million RIE grants are the flag- 
ship initiative of NSF’s new technology di- 
rectorate. They are designed to broaden the 
agency’s traditional focus on fundamental 
research to include turning discoveries into 
new industries and training workers for the 
resulting high-tech jobs. The grants are also 
supposed to improve economic conditions 
in what NSF calls “those regions of America 
that have not fully participated in the tech- 
nology boom of the past several decades.” 

Many lawmakers believe NSF hasn’t done 
enough to address those disparities, pointing 
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out that five states receive almost 40% of its . 


research dollars. So Congress will be watch- 
ing closely to see whether the agency does a 
better job of spreading the wealth with this 
first cohort of winners. Hoping to reassure 
lawmakers, NSF opted for maximum trans- 
parency by identifying the semifinalists, a 
sharp break from tradition for proposals that 
are still under review. 

Some applicants fear it isn’t going far 
enough. For 4 decades NSF has run the Es- 
tablished Program to Stimulate Competi- 
tive Research (EPSCoR), which is designed 
to help smaller and more rural states boost 
their research capacity. But institutions in 
EPSCoR states such as Idaho are leads for 
only eight—fewer than one-quarter—of the 
RIE semifinalists, whose proposals span 
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grants aim to 
revitalize left- 
behind regions 


34 teams make first cut for 
$160 million awards to drive 
economic and social change 


By Jeffrey Mervis 
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are still under review. 

Some applicants fear it isn’t going far 
enough. For 4 decades NSF has run the Es- 
tablished Program to Stimulate Competi- 
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their research capacity. But institutions in 
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A team from academia and industry hopes to win a grant to lower carbon emissions from Louisiana’s oil and chemical industries. 


14 topical areas ranging from agriculture to 
sustainable energy to wellness. That means 
Hudiburg’s team, which includes two tribal 
nations and a community college, must also 
compete against 26 proposals from 14 non- 
EPSCoR states, many led by top 20 research 
universities and major academic medical 
centers with large companies as partners. 

One is headed by the world-renowned 
Cleveland Clinic and includes two major re- 
search universities, hospitals, and Medtronic, 
a leader in medical devices and therapeutics. 
That team proposes to build and operate a 
comprehensive digital platform to improve 
health care outcomes in the Cleveland and 
Detroit areas. Although neither Ohio nor 
Michigan is an EPSCoR state, neurologist 
Lara Jehi, the clinic’s chief research informa- 
tion officer and co-leader on the proposal, 
notes that “Ohio and Michigan are among 
the worst nationally for health equity.” 

But some contenders believe having such 
juggernauts vie for the prize isn’t consistent 
with the goals of the program. “The idea is 
not to award more money to wealthy insti- 
tutions for work that they’re already doing,” 
says Michaele Armstrong, associate direc- 
tor of a regional hub for biotech startups at 
Washington State University that is part of 
the Idaho proposal. “Instead, an NSF Engine 
is supposed to help mobilize technologies or 
communities that have been left behind.” 

Some EPSCoR states contending for an 
NSF Engine are also home to well-resourced 
research institutions. A semifinalist team led 
by the University of Alabama at Birmingham 
(UAB), whose research portfolio ranks in the 
top 5% nationally, wants to use biotechnology 
to drive innovation and reduce health inequi- 
ties plaguing the region’s sizable Black popu- 
lation. UAB hopes to do that by partnering 
with some of the 14 historically Black colleges 
and universities (HBCUs) in the state. 
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“Alabama has the largest number of 
HBCUs in the country, and we also have a 
strong track record of recruiting minority 
patients in clinical trials,’ says Kathy Nugent, 
a biotech entrepreneur who oversees com- 
mercialization efforts at UAB and is leading 
the team. “But reducing health inequities is a 
hard problem, and the validation and visibil- 
ity that comes with winning an NSF Engine 
would really help.” 

Another EPSCoR-state semifinalist, based 
at Louisiana State University (LSU), aims to 
address another urgent societal problem: the 
massive carbon footprint of the state's oil, 
gas, and chemical industry. “If we can figure 
out a cheap energy source, whether that’s so- 
lar, nuclear, or wind, and then find a way to 
use its energy to transport and sequester our 
greenhouse gas emissions, we can essentially 
still be home to a very hydrocarbon intense 
industry,’ says Andy Maas, director of LSU’s 
office of innovation and commercialization, 
who leads the team. 

NSF expects the winning teams to also at- 
tract significant funding from both the pub- 
lic and private sectors. LSU should have no 
trouble satisfying that goal, Maas says: The 
big oil companies have announced some 
$27 billion in new low-carbon projects in the 
region, he notes, and New Orleans recently 
won $50 million in federal infrastructure 
funding to pursue green hydrogen. 

At the core of every proposal is a plan to 
carry out and commercialize the research 
conducted by the team, then train a local 
workforce needed for the relevant business 
sector to thrive. Typically, commercializa- 
tion involves helping a faculty member 
start a company or license their intellectual 
property. But a semifinalist based at the 
University of Texas at El Paso (UTEP)—in 
a state not eligible for EPSCoR—is propos- 
ing a new model that will allow small and 


medium-size companies in the aerospace 
and defense sector to order up the advances 
they need to prosper. 

“Most of those companies cannot afford 
the innovation that comes from a major uni- 
versity or a big corporate lab,’ says Ahsan 
Choudhuri, a UTEP aerospace engineer who 
leads the team. “So we're offering innovation 
as a service. We’ll provide them with the in- 
frastructure, talent, and innovation to rebuild 
America’s defense industrial base.” He calls it 
“a very grassroots approach.” 

Every RIE proposal must also include 
plans to increase diversity, equity, and inclu- 
sion (DEI) in the U.S. research enterprise. 
That could be a challenge in states, such as 
Texas, which have ordered universities to roll 
back or shutter such efforts. But like the other 
semifinalists from such states who spoke 
with Science, Choudhuri does not anticipate 
any problem in satisfying that requirement. 

“DEI is in our DNA,’ Choudhuri says, cit- 
ing the ethnic and gender diversity of his 
leadership team. “Hispanics make up 80% of 
our community, so we don’t have to set up an 
office to ensure diversity. By supporting us, 
you are helping to solve the problem.” 

NSF chose the first round of semi- 
finalists from 188 proposals and will win- 
now the list further after site visits. Each of 
the five winners will receive $15 million for 
2 years, with the promise of $145 million 
more over a decade. (The number and size 
of subsequent cohorts will depend on the 
agency’s future overall budget.) 

Hudiburg hopes to be one of the winners. 
But she won’t think of herself as a loser if her 
team’s proposal comes up short. 

“The connections we’ve made are perma- 
nent,” she says. “We have a dream, and if we 
don’t get an NSF engine, we'll keep looking 
for other funding. Because the atmosphere 
can’t wait.” 
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India plans to invigorate science 
with hefty new funding agency 


Proposed National Research Foundation would spend 
$6 billion over 5 years, but is drawing mixed reactions 


By Pratik Pawar 


he Indian government has announced 
an ambitious plan to create a new 
National Research Foundation (NRF) 
that would pump $6 billion into re- 
search over 5 years. But it is drawing 
mixed reviews from scientists. 

Although few details are available, many 
researchers are welcoming the proposal 
to create a new “apex body” for Indian 
research, saying it could help boost the 
nation’s relatively meager investments in 
basic and applied science. But some worry 
the foundation might fall prey to political 
interference. They are also skeptical it will 
be able to attract the envisioned funding, 
70% of which is supposed to come from 
private industry. That goal is likely “a little 
unrealistic,” says policy specialist Shailja 
Vaidya Gupta, a former senior science ad- 
viser to the government. 

The NRF proposal, unveiled on 28 June by 
India’s Union Council of Ministers, is rooted 
in a 2019 report requested by Prime Min- 
ister Narendra Modi. The report, from the 
Prime Minister’s Science, Technology and 
Innovation Advisory Council, noted that In- 
dia lagged far behind other large nations in 
the share of gross domestic product (GDP) 
it spends on research and in the quality of 
research papers and patents it produces. 
Many Indian universities, it added, do not 
conduct any research. 
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To help reverse those trends, the report 
called for a powerful new research agency, 
similar to the National Science Foundation 
in the United States, which would help co- 
ordinate science policy and consolidate and 
expand funding, including peer-reviewed 
grants to academic researchers. The agen- 
cy’s annual budget should equal 0.1% of 
India’s GDP, or more than $2 billion at the 
time, it said. To protect the agency from po- 
litical pressure and bureaucratic red tape, 
the report recommended that an indepen- 
dent board of prominent scientists choose 
its leaders, and that it operate with “com- 
plete autonomy ... so that it does not face 
hurdles in funding good projects.” 

The new proposal departs substantially 
from that vision. It calls for the prime 
minister to preside over NRF’s board, for 
example, and for the government’s science 
and education ministers to fill two other top 
jobs. The science ministry and India’s sci- 
ence adviser will also help oversee NRF and 
its governing board. 

Having the prime minister occupy such a 
senior role “tells you the seriousness [with] 
which we want to take the R&D ecosystem,” 
says Ajay Sood, the government’s princi- 
pal science adviser. And researchers say 
the structure means the foundation could 
prosper under politicians who support its 
mission. “It’s not a bad idea for the prime 
minister to be at the helm,’ says Partha 
Majumder, founder of the National Insti- 
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Anew funding agency could boost science progrz upde 
at Indian universities, such as the University of Dem. 


tute of Biomedical Genomics and a member 
of India’s Science and Engineering Research 
Board (SERB), created in 2008 in an earlier 
effort to address funding struggles. 

NRF’s links to powerful politicians 
could also help it attract industry support, 
Majumder says. Senior government officials 
“can probably talk to industry much more 
effectively than a bunch of us,” he says. 

But the arrangement could expose NRF to 
political meddling, warns the Breakthrough 
Science Society, an advocacy group. It notes 
that Modi’s government has sometimes pro- 
moted fringe, pseudoscientific ideas—such 
as claims that ancient Hindus invented 
aircraft and the internet. That “cannot but 
make scientists apprehensive of the future 
direction and fate of science research under , 
an NRF with such political leaders at its 
helm,” the society said in a statement. 

India’s notoriously ponderous bureau- 
cracy could pose another threat to the new 
agency, says Gupta, who worked on the 
NRF proposal in 2020 when she was a gov- 
ernment official. Because the foundation 
would not be an independent agency, it will 
“have to adhere to rules which apply across 
sectors,” she says. “So, if you’re building a 
bridge, or you're discovering new proteins, 
you are treated the same way.” 

The plan calls for NRF to subsume SERB. 
Ultimately, the government says the foun- 
dation will receive more public money than 
it gave to SERB. But, overall, officials are 
counting on the private sector to contribute 
more than $4 billion of NRF’s funding from 
2023 through 2028. 

Sood acknowledges that “these are aspi- 
rational numbers.” But he says they reflect 
the government’s desire “to bring other peo- 
ple [on board and] make them co-partners 
in the R&D and innovation ecosystem.” 

Still, others are skeptical. “Banking on + 
the corporate sector to pitch in witha sum . 
exceeding [the] government commitment 
is a wild dream,’ the Breakthrough Science 
Society said. 

It’s been difficult to fully assess the NRF 
proposal, researchers say, because the gov- 
ernment will not publicly release the leg- 
islation creating the foundation until it is 
passed by India’s parliament. “It shouldn’t 
be this way ... [the draft bill] should be up 
for public debate before we could put it up 
into the parliament,’ Gupta says. 

Action on the bill could come as early as 
this summer, with Parliament scheduled to 
hold its Monsoon Session from 20 July to 
11 August. 
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Pratik Pawar is a journalist in Bangalore, India. 
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Scientists mourn Arecibo’s impending shutdown 


SCIENTIFIC COMMUNITY 


Astronomers scramble to find work as storied observatory faces uncertain future 


By Claudia Lopez Lloreda, 
in Arecibo, Puerto Rico 


t the end of a winding road lined by 
flowering red flamboyan trees, you 
can still see the broken remains of 
the world’s largest single-dish radio 
telescope. The Arecibo Observatory’s 
305-meter telescope collapsed in 
2020, and in October 2022 its owner, the 
National Science Foundation (NSF), an- 
nounced that after 6 decades, the site would 
no longer be an observatory. Instead, NSF 
plans to transform it into an educational fa- 
cility. Contracts for up to 90 staff members, 
including 25 researchers, end on 14 August. 

“We're closing out all science activities,” 
says physicist Julie Brisset of the University 
of Central Florida (UCF), who’s in charge 
of maintenance and research at Arecibo. To 
scientists who have worked at the observa- 
tory, used its data, or mentored local stu- 
dents, it is a painful transition. “It’s very sad,’ 
says Brazilian physicist Pedrina Terra dos 
Santos, who has worked at the observatory for 
17 years and, like many of her colleagues, is 
still seeking a new job. 

Arecibo was once central to astronomy; 
just a few weeks ago, astronomers relied on 
archived data from its giant radio telescope 
in their report on the long-sought hum of 
gravitational waves from supermassive black 
holes. Even after engineering failures and 
poor maintenance brought down the big 
scope, Arecibo continued to observe the skies 
with its remaining instruments, including a 
12-meter radio telescope and an optical lab 
for probing atmospheric particles and winds. 

In October 2022, NSF announced it would 
not rebuild the giant telescope, saying it was 
following community recommendations for 
the best use of scarce research dollars. It is 
now shutting down most of the smaller in- 
struments as well. As scientists depart, “all 
the expertise associated with instruments is 
leaving,’ Brisset says. Olga Figueroa-Miranda, 
director of the observatory, says people from 
UCF, Puerto Rico’s Metropolitan University, 
and Yang Enterprises, an engineering firm, 
will be let go, including herself. She has yet to 
find a new position. 

Angel Vazquez, an engineer and director of 
telescope operations at Arecibo, found a new 
job with the National Radio Astronomy Ob- 
servatory but worries others may not be as 
lucky. Many will have to leave the island to 
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pursue their careers. “There is no way that I 
can stay,’ dos Santos says. Her son, who was 
born in Puerto Rico, once dreamed of work- 
ing at Arecibo. 

The exodus is likely to contribute to Puerto 
Rico’s severe brain drain, and will deplete 
the pool of mentors for student research 
programs and hands-on training like EN- 
CANTO, a program for undergraduates from 
underrepresented groups. “The pipeline of 
Hispanic scientists that came through the ob- 
servatory is unmatched,” says Robert Kerr, a 
physicist who was director of the observatory 
for 6 years. “I don’t know how exactly we re- 
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NSF is banking on using the site’s visitor 
center and facilities in what will be called the 
Arecibo Center for STEM Education and Re- 
search (ACSER). Officials say they hope the 
new center will match what the observatory 
did for education in Puerto Rico by offering 
student field trips, summer programs, and 
exhibits. “We want to embrace more of the 
scientific significance of the area,” such as its 
ecology, says Monya Ruffin, a deputy division 
director at NSF. The agency expects to award 
funds to one proposal by the end of the year. 

Ubaldo Cérdova Figueroa, a professor at 
the University of Puerto Rico at Mayagiiez, 


The 2020 collapse of Arecibo’s giant radio telescope has upended some scientists’ careers. 


place that.” This year ENCANTO was reduced 
from 2 years to one; its future is uncertain. 

NSF says it’s still open to proposals for 
scientists to visit Arecibo and use the instru- 
ments that remain, such as the 12-meter 
telescope. And for the site of the big scope, 
scientists including Brisset recently proposed 
a smaller telescope. But former Arecibo Di- 
rector Mike Nolan, now an astronomer at the 
University of Arizona, says the area around 
the giant telescope is “going to be just a hole 
in the ground. ... The jungle will come back.” 

For now, researchers are scrambling to 
get their last bits of data, archive them, and 
move instruments before they have to leave. 
“We have 1 month to get the data,” physicist 
Anish Damodaran says of solar observations 
he made with the 12-meter telescope. Kerr 
has about $1 million worth of instruments in 
the optical lab that he must move back to the 
United States. 


has submitted a proposal with more than 
30 partners. He envisions two programs, one 
focused on education and one on research 
that includes ecology, such as genetic studies 
of endemic Puerto Rican species. His pitch 
calls for science camps for students, trainings 
for teachers, and 3D printing workshops at 
the former observatory’s machine shop. 

But Nolan wonders how effective such pro- 
grams can be if no astronomers or physicists 
are on site and students can’t see science be- 
ing done. And Cordova Figueroa worries that 
even if he wins the award, ACSER’s funding, 
totaling $5 million over 5 years, may not be 
enough for the programs he envisions. 

Nine months after the announcement 
of the transition, Arecibo’s future remains 
murky. On 14 August, “we are going to shut 
off the lights and put the locks on the doors,” 
Vazquez says. “We don’t know what will hap- 
pen after that.’ = 
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PALEOCLIMATE 


Lava outburst may have led to Snowball Earth 


Eruption dates suggest chemical weathering of rocks triggered plunging temperatures 


By Maya Wei-Haas 


bout 717 million years ago, a climate 

catastrophe struck the planet, as 

temperatures plunged and glaciers 

enveloped the globe. The cause of 

this “Snowball Earth” episode has 

been mysterious, but it took place 
around the same time as a massive out- 
burst of volcanism. Many researchers 
thought there might be a connection. But 
the timing was uncertain. 

Now, more precise dates, reported last 
month in Earth and Planetary Science 
Letters (EPSL) and in November 2022 in 
Science Advances, show the eruptions pre- 
ceded the Snowball Earth event by 1 mil- 
lion to 2 million years. The lag points to 
a particular way the fire could have trig- 
gered the ice: through a chemical altera- 
tion of the fresh volcanic rocks known as 
weathering, which sucks carbon dioxide 
(CO,) from the atmosphere, turning down 
the planetary thermostat. The studies 
highlight the power of weathering as a key 
driver behind shifts in Earth’s climate, and 
how components of the planet as disparate 
as rocks and the atmosphere are inextri- 
cably linked, says EPSL study co-author 
Galen Halverson, a sedimentary geologist 
at McGill University. “Nothing can be un- 
derstood in isolation.” 

Geoscientists debating the cause of the 
so-called Sturtian glaciation, which lasted 
57 million years, have pointed to a number 
of possibilities—meteorite strikes, biologic 
activity, shifts in Earth’s orbit, and more. But 
recent studies have zeroed in on one of the 
largest volcanic outbursts ever, preserved 
today across northern Canada in what’s 
called the Franklin large igneous province 
(LIP). The eruptions spewed lava across 
an area at least the size of Argentina— 
and perhaps bigger than China. 

Volcanism can trigger cooling in two 
main ways. In one, eruptions release 
sulfur-rich gases, which form aerosols that 
block sunlight and cool the planet. A sec- 
tion of the Franklin lava likely even burst 
through rocks full of sulfur-rich minerals 
that could have supercharged the plumes. 
The other mechanism is weathering. Lava 
rocks are particularly susceptible to the re- 
actions, in which CO, in rainwater reacts 
with the rocks, ultimately forming miner- 
als that precipitate in the ocean. 
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“The difference between all these propos- 
als is the timescales on which they happen,” 
says Judy Pu, lead author of the Science 
Advances study at the University of Cali- 
fornia, Santa Barbara. Sulfur aerosols, for 
example, only linger in the atmosphere for 
months to years. Weathering, however, is 
much slower, requiring 1 million to 2 mil- 
lion years before the cooling effects peak. 

Yet until recently, the estimated timing 
for the Franklin LIP spanned more than 
10 million years, making it impossible to 
discern which of the two mechanisms 


Some 717 million years ago, glaciers covered the 
globe. Earlier eruptions may have triggered the event. 


could have been the trigger. Improvements 
in dating—and the two teams’ sheer deter- 
mination to recover rare minerals to date— 
are helping finally pin down the timing. 
“Everything starts with good ages,” says 
Ashley Gumsley, a geologist specializing in 
LIPs at the University of Silesia in Kato- 
wice who was not part of the study teams. 
By measuring the ratios of trace amounts 
of uranium and lead trapped in tiny crys- 
tals of the mineral zircon—and knowing 
how fast uranium decays to lead—the 
teams discovered that the Franklin LIP 
formed in just 2 million years or so, much 
faster than most previous estimates. And 
the primary pulse of volcanism happened 
1 million to 2 million years before Snow- 
ball Earth, which has been dated through 
analysis of rocks scoured up by glaciers 
and eventually deposited in the ocean. 
That’s exactly the sort of time frame re- 
quired for weathering to cause the cooling. 


“They’re just stunning the extent to which 
[the dates] all line up,” Halverson says. Ad- 
ditional analysis by Pu and her colleagues 
suggest the Franklin LIP formed a broad 
volcanic highland that would have been 
battered by wind and water, speeding up 
the weathering. 

Pu says other factors may have dialed up 
the weathering enough to cause a Snow- 
ball Earth event. For starters, all of Earth’s 
landmasses at the time were located near 
the equator, where temperatures were 
warm and rain frequently pelted the sur- 
face. The floods of lava also emerged 
during the breakup of the Rodinia 
supercontinent, which exposed fresh rocky 
surfaces to weathering and may have al- 
ready begun to cool the global climate. 
And Gumsley says other large eruptions 
probably took place at this time in Siberia, 
China, Africa, and Antarctica, adding to 
the overall weathering effect. 

Case closed? Not for Harvard University 
geologist emeritus Paul Hoffman, a co- 
author on the Science Advances study who 
led much of the early work on Snowball 
Earth. Even if the timing of the Franklin 
LIP has improved, that of the global gla- 
ciation remains slightly uncertain, he says. 
Forming the ice-scoured rock that marks 
the start of the episode requires the flow 
of thick ice at sea level, a process that may 
not have started until several hundred 
thousand years after the oceans froze over. 

Other Snowball periods remain mysteri- 
ous. Some 650 million years ago, not long 
after the Sturtian glaciation, the planet 
plunged into another deep freeze—but no 


volcanic outburst preceded it, notes Linda . 


Sohl, a paleoclimatologist at Columbia 
University who was not part of the study 
teams. Still, Halverson says weathering 
may have been a key factor then, too, be- 
cause all the scouring by glaciers during 
the earlier Snowball would have exposed 
fresh rock. 

For the Sturtian glaciation at least, the 
case for a weathering trigger has grown, 
Sohl says. “In the very least, a revisit with 
weathering models seems in order,’ she 
says. “The snowball glaciations are such fas- 
cinating events that are going to keep all of 
us quite busy for years to come.” 


Maya Wei-Haas is a science journalist in 
Washington, D.C. 
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IEA PIGBOTS 


Doing research with human subjects is costly and cumbersome. 
Can AI chatbots replace them? 


By Matthew Hutson; /!/ustrations by Ashley Mastin 
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or Kurt Gray, a social psychologist 
at the University of North Carolina 
at Chapel Hill, conducting experi- 
ments comes with certain chores. 
Before embarking on any study, 
his lab must get ethical approval 
from an institutional review board, 
which can take weeks or months. 
Then his team has to recruit online 
participants—easier than bringing people 
into the lab, but Gray says the online sub- 
jects are often distracted or lazy. Then the 
researchers spend hours cleaning the data. 
But earlier this year, Gray accidentally saw 
an alternative way to do things. 

He was working with computer scientists 
at the Allen Institute for Artificial Intelli- 
gence to see whether they could develop an 
AI system that made moral judgments like 
humans. But first they figured they’d see 
if a system from the startup OpenAI could 
already do the job. The team asked GPT- 
3.5, which produces eerily humanlike text, 
to judge the ethics of 464 scenarios, previ- 
ously appraised by human subjects, on a 
scale from -4 (unethical) to 4 (ethical)— 
scenarios such as selling your house to fund 
a program for the needy or having an affair 
with your best friend’s spouse. The system’s 
answers, it turned out, were nearly identical 
to human responses, with a correlation coef- 
ficient of 0.95. 

“T was like, ‘Whoa, we need to back up, 
because this is crazy,” Gray says. “If you 
can just ask GPT to make these judgments, 
and they align, well, why don’t you just ask 
GPT instead of asking people, at least some- 
times?” The results were published this 
month in Trends in Cognitive Science in an 
article titled “Can AI Language Models Re- 
place Human Participants?” 

Generative language models, as these AI 
systems are known, have taken the world by 
storm. Perhaps the best known is OpenAI’s 
series of GPT models, which power the 
ChatGPT chatbot. But other major tech 
companies, including Google and Meta, are 
plowing resources into their own models. 
After being trained on massive amounts of 
text from books and web pages, these mod- 
els have an uncanny ability to mimic verbal 
human behavior. They have already found 
use in writing computer code, summarizing 
legal documents, and powering chatbots 
that tutor students or conduct therapy. 

Now, researchers are considering Al’s 
ability to impersonate human subjects in 
fields such as psychology, political science, 
economics, and market research. No one is 
yet suggesting that chatbots can completely 
replace humans in behavioral studies. But 
they may act as convenient stand-ins in pilot 
studies and for designing experiments, sav- 
ing time and money. Language models might 
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also help with experiments that would be too 
impractical, unethical, or even dangerous 
to run with people. “It’s a really interesting 
time,” says Ayelet Israeli, a marketing profes- 
sor at Harvard Business School who believes 
the models’ impact on behavioral research 
could amount to a “revolution.” “Some of 
these results are just astonishing.” 


IN HIS ETHICS study, Gray was using GPT- 
3.5 as a sort of collective everyman, in the 
hopes of soliciting an average human re- 
sponse. But such models can also be used 
to populate panels with strikingly diverse 
participants, because they can be prompted 
to play anyone: A model contains multi- 
tudes. Last year, researchers at Brigham 
Young University (BYU) created what they 
call “silicon samples,” simulations of hu- 
man samples. In one study, they fed GPT-3 
information about an adopted guise, includ- 
ing age, gender, race, education level, and 
political affiliation. When the researchers 
left one of these variables out and asked 
the model to fill it in, its answers closely 
matched those from a survey of voters. The 
researchers also found the model spat out 
political speech that matched its assigned 
party affiliation. Lisa Argyle, a BYU politi- 
cal psychologist and co-author on the study, 
wants to use the virtual participants to test 
questions for online surveys, identifying 
those that are most likely to be revealing. 
That could make actual surveys more ef- 
ficient. “This is important because survey 
samples are growing both more expensive 
and less representative,” she says. 

Language models can also adopt personal- 
ity archetypes. In a study led by Hang Jiang, 
a computer scientist at the Massachusetts 
Institute of Technology (MIT), researchers 
had GPT-3.5 assume hundreds of personas 
by prompting it to behave with different 
combinations of personality traits—for ex- 
ample, introverted, antagonistic, conscien- 
tious, neurotic, and closed to experience. For 
each persona, they had the model complete 
a standard personality test and write an 
800-word childhood story that was then 
analyzed for psycholinguistic features as- 
sociated with personality traits. The models 
dutifully manifested their assigned person- 
alities both on the test and in the stories. 
Jiang says such models might allow re- 
searchers to test, say, how well people with 
different personalities would perform in 
various jobs. 

Market researchers are already finding 
value in the models. In a recent study, Israeli 
and colleagues found that GPT-3.5 seemed to 
display realistic consumer behavior. When 
asked whether it would buy a laptop at 
various prices, it was less sensitive to price 
when told its income was $120,000 versus 


$50,000. It preferred whatever toothpaste 
brand it had bought previously, and it would 
pay less for yogurt if it already had a lot at 
home. It also said it would pay realistic pre- 
miums for certain product attributes, such 
as toothpaste with fluoride and deodorant 
without aluminum. 

The model didn’t always give the same an- 
swers but instead offered a range of responses 
about its preferences and willingness to pay. 
Israeli and her colleagues aggregated its 
many responses, building a virtual customer 
survey for these token products for a fraction 
of the time and money it would have taken in 
the real world. Language model training data 
are biased toward Western, affluent people, 
so the consumer survey might be similarly 
skewed. But Israeli imagines prompting the 
AI to impersonate a range of consumers—or 
to zoom in on a particular demographic—in 
order to create a more representative study 
of a product’s appeal or potential. 

One market research company is already 
putting language models to work. The 
startup Synthetic Users has set up a service 
using OpenAI models in which clients— 
including Google, IBM, and Apple—can de- 
scribe a type of person they want to survey, 
and ask them questions about their needs, 
desires, and feelings about a product, such 
as a new website or a wearable. The com- 
pany’s system generates synthetic interviews 
that co-founder Kwame Ferreira says are 
“infinitely richer” and more useful than the 
“bland” feedback companies get when they 
survey real people. 


CHATBOTS CAN ALSO be pitted against one 
another to study more complex human 
interactions. Last year, researchers at 
Stanford University and Google developed 
“social simulacra” for studying user be- 
havior on platforms such as Facebook and 
Reddit. The researchers populated a plat- 
form they called SimReddit with the equiv- 
alent of 1000 different users, by repeatedly 
prompting GPT-3 with a user identity, a 
community topic, community rules, and 
previous posts to the forum. Humans had 
a hard time distinguishing the resulting 
discussions from real ones, and platform 
designers found the tool useful for creating 
rules or moderation practices. 

This year, the researchers built a more 
immersive simulation populated with what 
they call “generative agents.” The charac- 
ters were given the ability to remember 
experiences, reflect on them, and gener- 
ate and execute plans. Organized behavior 
emerged: the researchers gave one agent 
the idea to throw a Valentine’s Day party, 
and over 2 days all the agents in town co- 
ordinated to throw one. Joon Sung Park, 
a Stanford computer science graduate stu- 
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dent who led both projects, says the virtual 
world could be used to study the effect of 
economic policies over time before impos- 
ing them on real people. 

Economists and psychologists have used 
agent-based models for many years, pro- 
gramming both the agents and the rules of 
engagement. But the simulations tend to be 
simple and depend on hand-coded theoreti- 
cal assumptions. John Horton, an economist 
at the MIT Sloan School of Management who 
has done related work, says agents based on 
language models are more realistic. He imag- 
ines simulating thousands of job seekers and 
hiring managers to test labor market regula- 
tions. “That would be pretty wild,” he says. 


AI will progress quickly in any case. “It is 
plausible that we will have a system within 
a few years that can just be placed into any 
experiment and will produce behavior in- 
distinguishable from human behavior.” 

A critical question is whether language 
models will not just reproduce existing 
findings, but generalize and predict new 
ones. When models appear to match pub- 
lished psychology studies, they could be 
regurgitating training data in response to 
memorized questions. As a result, many re- 
searchers are taking pains to phrase ques- 
tions in novel ways. 

Another lingering issue is whether mod- 
els reflect what people would actually do or 


It is plausible that we will have a 
system within a few years that can just 


be placed into any experiment and 
will produce behavior indistinguishable 
from human b 


DESPITE ALL their apparent capabilities, the 
language models are by no means perfect 
human mirrors. They display several classic 
human biases but not others. For instance, 
one recent study of GPT-3.5 found that, 
like humans, it tends to overestimate how 
widespread its opinions are in the general 
population, a bias known as the false con- 
sensus effect. But unlike humans, the model 
showed little hesitation in taking risks and 
tempting fate. Marcel Binz, a cognitive sci- 
entist at the Max Planck Institute for Bio- 
logical Cybernetics, says AI might need to 
physically interact with the world to exactly 
mimic human participants; it’s hard to 
learn all the nuances of intelligent behavior 
just through passive reading. But he thinks 
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just what they say they’d do. People often 
lie to researchers—and even themselves. 
Synthetic Users co-founder Hugo Alves 
suspects the models state true preferences, 
because they’re trained partially on the na- 
kedly honest material contained in anony- 
mous discussion forums. “I’ve asked things 
in parenting forums that I wouldn’t ask a 
friend,” he says. 

Horton worries that the unguarded re- 
sponses may not last, as OpenAI and oth- 
ers guide their models to be safer and less 
offensive. “The push to make these models 
more aligned and not say bad stuff is kind 
of contrary to social science,” he says. “Real 
people aren’t nice all the time. Real people 
say racist, sexist stuff” 


FOR NOW, synthetic participants are most 
useful for piloting experiments, researchers 
say. If a model gives unexpected answers to 
survey questions or doesn’t respond at all, 
Argyle says, your questions may be hard to 
understand and need rewriting. Israeli says 
you can design a survey with 1000 ques- 
tions and use language models to narrow 
them down to those most likely to correlate 
with an outcome of interest. Similarly, in 
economics experiments, Horton says you 
could run 1 million bargaining scenarios 
with a model to identify the factors that 
most affect behavior—before launching the 
study with people. “The simulation kind of 
gave you a map,” he says. 

You could also run experiments you 
wouldn’t ever want to do with people. The 
1963 Milgram experiment, in which par- 
ticipants obeyed orders to deliver what they 
thought were increasingly strong electric 
shocks to an unseen second set of subjects, 
probably wouldn’t pass ethical review to- 
day. But it was easy enough for Gati Aher, 
an undergraduate computer science student 
at Olin College of Engineering, to replicate 
the infamous study with GPT-3. She and her 
colleagues found that, like the people in the 
original experiment, the model didn’t begin 
to let off the button until 300 volts. 

Aher thinks the models could provide 
guidance in other sensitive arenas that are 
difficult to study, for example, what to say 
to a person who is suicidal. Gray says re- 
searchers could study ostracism, or the role 
of negative feedback on self-esteem. Or, he 
says, they could study dehumanization of the 
kind seen in the Vietnam War’s My Lai mas- 
sacre by describing the situation and asking 
a model what it would do—provided the 
models aren’t too sanitized. 

Argyle says she doesn’t know of anyone 
yet who has replaced human participants 
with language models. “To be honest, this is 
all still pretty much a hypothetical,” she says. 
“First we have to demonstrate the language 


models can do the work.” But Horton believes. 


the shift is inevitable. It reminds him of a 
similar transformation a decade ago, when 
many social science experiments moved from 
in-person to online surveys. “People were 
like, ‘How can you run experiments online? 
Who are these people?’ And now it’s like, ‘Oh, 
yeah, of course you do that:” 

Chatbots might already be infiltrating 
online surveys—but among subjects rather 
than researchers. A recent study asked 
crowdworkers to summarize some text and 
found that at least one-third were likely us- 
ing ChatGPT. Gray says, half-jokingly, “If on- 
line participants are already using GPT, we 
might as well just ask GPT itself,” = 


Matthew Hutson is a journalist in New York City. 
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Genetic editing of wood for sustainability 


Trees engineered to have less lignin could make paper production less polluting 


By Vania G. Zuin Zeidler 


ignin, a polymer formed by phenyl- 

propanoid units, is responsible for the 

rigidity and resistance of the lignocel- 

lulosic cells in wood (7). In conventional 

pulp production, lignin must be cleaved 

and dissolved under alkaline conditions 
or first sulfonated to make it soluble so that fi- 
ber separation can take place. Delignification 
processes are reagent and energy intensive, 
leading to costly chemical recovery (2). Pulp 
treatment methods to remove wood extrac- 
tives such as lignin have been developed, but 
they are not yet economically viable at an in- 
dustrial scale (3). On page 216 of this issue, 
Sulis et al. (4) present a multiplex CRISPR 
genome editing strategy to modify lignin bio- 
synthesis genes and reduce the lignin content 
of Populus trichocarpa, a species of poplar. 
This approach could provide a solution to a 
key operational constraint in the paper and 
pulp industry. 

Around 50 million tons of lignin are gener- 
ated per year as a by-product in the paper and 
pulp industry (5). In 2021, the industry pro- 
duced greenhouse gases with a global warm- 
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ing potential equivalent to 190 million tons 
of carbon dioxide, a figure that is projected to 
increase in the next decades (6). In addition, 
almost 1.7 billion tons of feedstocks and rea- 
gents were used to produce around 820 mil- 
lion metric tons of final chemical products in 
2015, while also producing around the same 
quantity of chemical waste (7). These statis- 
tics reveal resource and process inefficiencies 
that have resulted in substantial chemical 
releases to environmental compartments, in- 
cluding the atmosphere, soil, water, and ani- 
mal and plant life (8). 

Most commercially valuable woods are 
genetically complex, which makes it diffi- 
cult to determine which genes contribute to 
the complex phenotypes and associate them 
with causal mechanisms, such as lignin bio- 
synthesis (9). Therefore, recently developed 
bioengineering approaches such as multiplex 
CRISPR genome editing that allow multi- 
ple genes to be edited in a single effort—a 
goal that was more difficult to achieve with 
previous approaches—are likely to become 
important for developing a new generation 
of woody biorefinery crops (10, 11). Indeed, 
multiplex genome editing of other plants has 
already been used to modify traits such as 
herbicide resistance, hormone biosynthesis, 
and metabolic function (12). 


Sulis et al. identified target genes for their 
study with a previously established computa- 
tional predictive model for monolignol bio- 
synthesis in poplar trees (13). The oxidative 
polymerization of three major monolignols is 
the predominant method of lignin formation 
in flowering plants such as poplar (14). The 
aim of the model was to identify genetic mod- * 
ifications that would result in reduced lignin : 
content and increased syringyl-to-guaiacyl 
and_ carbohydrate-to-lignin ratios. Seven 
strategies involving the modification of three + 
to six genes each and with the best predicted . 
fiber trait improvements were selected for 
multiplex editing in poplar wood. 

A multiplex CRISPR construct was assem- 
bled for each strategy and delivered to pop- 
lar trees using Agrobacterium tumefaciens, 
generating 174 independent lines of edited 
poplar wood. The wood of a 6-month-old 
wild-type poplar tree contained 20.9% lignin 
and had a carbohydrate-to-lignin ratio of 
3.0, whereas in edited wood, lignin content 
was reduced by up to 49.1%, and the carbo- 
hydrate-to-lignin ratio was increased by up 
to 228%. However, it should be noted that 
several of the edited lines with substantial 
reductions in lignin content also had reduced 
tree growth, demanding additional studies 
to establish the impact of such a change on 
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Wood from Populus trichocarpa, a variety 
of poplar tree, is used to manufacture paper, 
a process that can be highly polluting. 


the pulp and paper industry. These stud- 
ies should also assess wood density and the 
performance of edited trees in actual planta- 
tions, with a higher number of replicates per 
line. In Europe and North America, poplar 
has become an important source of wood 
for pulp and paper, but it is not the only tree 
used for this purpose. In the future, it would 
be interesting to apply the approach taken by 
Sulis et al. to other wood varieties. 

Sulis et al. modeled the techno-economic 
impacts of CRISPR-edited wood on kraft 
pulping and found that yield and produc- 
tion efficiency were increased, resulting 
in reduced need for pulping chemicals. 
Additionally, they estimated that pulp pro- 
duction using edited wood would reduce the 
global warming potential of the process by 
up to 20%. These changes would be a step 
toward a more sustainable fiber production 
system, which would align with the United 
Nations Sustainable Development Goals (15). 

In the European Union, engineered trees 
such as those produced by Sulis et al. are con- 
sidered genetically modified organisms and 
must comply with the directive 2001/18/EC. 
This means that the European regulations 
would limit, or even forbid, the market in- 
troduction of gene-edited trees. However, 
some suggest that there is no reason to dif- 
ferentiate gene-edited trees with small dele- 
tions or insertions from the ones produced 
by conventional breeding or mutagenesis and 
that, to face the current challenges related to 
environmental impacts, biosafety regulations 
should be based on the specific risks and ben- 
efits of each product (9). 
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Metabolic control of 


transcription 


Light alters histone methylation in plants via 
nuclear a-ketoglutarate dehydrogenase 


By Eric B. Taylor 


echanisms that link environmen- 

tal stimuli to organismal tran- 

scriptional responses facilitate ad- 

aptation and survival. Because it 

signals energy availability through 

photosynthesis, light is among the 
most important environmental signals for 
plants. On page 179 of this issue, Huang 
et al. (1) demonstrate that, in the plant 
Arabidopsis thaliana, light exposure stim- 
ulates chromatin-local depletion of the 
tricarboxylic acid (TCA) cycle metabolite 
a-ketoglutarate to orchestrate global gene 
expression by impairing demethylation of 
nucleosomal histone tails. These findings 
provide a molecular mechanism for how 
the Arabidopsis genome senses light. They 
also demonstrate how subcellular metabo- 
lism can be spatially reorganized to regu- 
late chromatin structure and transcription. 

Methylation of histone-tail lysine (K) 
residues is a conserved mechanism for both 
activating and repressing gene expression 
(2, 3). Nuclear DNA wraps around histones 
to form chromatin, and modification of 
histone-tail lysines alters chromatin struc- 
ture and modulates interaction with tran- 
scription control machinery. Methylation 
(me) patterns are diverse, including mono 
(mel)-, di (me2)-, and tri (me3)-methyla- 
tion, with lysines on histone 3 (H3) known 
to be especially important for regulating 
gene expression. 

Jumonji C domain-containing histone de- 
methylases (JMJs) are a major class of his- 
tone demethylases that use a-ketoglutarate 
as a rate-limiting substrate to remove methyl 
groups from histone lysines by oxidation 
(4-6). In the TCA cycle, a-ketoglutarate is 
produced by isocitrate dehydrogenase (IDH) 
and catabolized by a-ketoglutarate dehy- 
drogenase (KGDH) to succinyl-coenzyme 
A. Using subcellular fractionation, light mi- 
croscopy, and immunoprecipitation, Huang 
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et al. showed that after light exposure, a 
small fraction of KGDH translocates into the 
nucleus and physically interacts with JMJs 
(see the figure). 

KGDH comprises three subunits: E1, E2, 
and E3. Huang et al. found that compared 
with wild-type cells, whole cellular genetic 
disruption of either El or E2 subunits 
decreased the abundance of H3K4me3, 
H3K9me2, H3K27me3, and H3K36me3 
methylation marks. This decrease was ac- 
companied by late flowering of the plant 
and accumulation of antioxidant anthocy- 
anin pigments in seedlings. RNA sequenc- 
ing revealed broad changes in expression 
of genes related to abiotic and biotic stress 
responses, which is consistent with per- 
turbed sensing of the environment. 

Because of its central role in cellular 
metabolism, genetic ablation of KGDH 
could lead to distal, nonspecific stress re- 
sponses that affect histone methylation 
and gene expression. Thus, a key chal- 
lenge faced by Huang et al. was to parse 
global effects of metabolic stress from 
those specific to loss of nuclear a-ketoglu- 
tarate. Through experiments in which the 
genes encoding KGDH EI and E2 subunits 
were deleted and then added back with or 
without mutations, they determined that 
regulation of gene expression by KGDH 
depends on nuclear localization conferred 
by the nuclear localization sequence in the 
E2 subunit. 

Among genes that might explain the ob- 
served late flowering and anthocyanin ac- 
cumulation, the central repressor of flow- 
ering FLOWERING LOCUS C (FLC) and 
master regulator of anthocyanin biosyn- 
thesis PRODUCTION OF ANTHOCYANIN 
PIGMENT I (PAPI) genes were both dere- 
pressed by genetic KGDH disruption. The 
authors performed tests of causality by 
mutating either FLC or PAPI in the pres- 
ence of KGDH disruption. These secondary 
mutations largely rescued late flowering 
and anthocyanin accumulation, respec- 
tively. Through this result, Huang e¢ al. 
show that, although many metabolic sig- 
naling events clearly evolved as feedback 
mechanisms to tightly maintain metabolic 
homeostasis (7, 8), metabolic signaling out- 
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of poplar tree, is used to manufacture paper, 
a process that can be highly polluting. 


the pulp and paper industry. These stud- 
ies should also assess wood density and the 
performance of edited trees in actual planta- 
tions, with a higher number of replicates per 
line. In Europe and North America, poplar 
has become an important source of wood 
for pulp and paper, but it is not the only tree 
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Sulis et al. to other wood varieties. 
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in reduced need for pulping chemicals. 
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those specific to loss of nuclear a-ketoglu- 
tarate. Through experiments in which the 
genes encoding KGDH EI and E2 subunits 
were deleted and then added back with or 
without mutations, they determined that 
regulation of gene expression by KGDH 
depends on nuclear localization conferred 
by the nuclear localization sequence in the 
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PIGMENT I (PAPI) genes were both dere- 
pressed by genetic KGDH disruption. The 
authors performed tests of causality by 
mutating either FLC or PAPI in the pres- 
ence of KGDH disruption. These secondary 
mutations largely rescued late flowering 
and anthocyanin accumulation, respec- 
tively. Through this result, Huang e¢ al. 
show that, although many metabolic sig- 
naling events clearly evolved as feedback 
mechanisms to tightly maintain metabolic 
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a-Ketoglutarate dehydrogenase prevents histone demethylation 

In cells of the plant Arabidopsis thaliana, light exposure stimulates the translocation of a small portion of 
a-ketoglutarate dehydrogenase (KGDH) to the nucleus, where it interacts with Jumonji C domain-containing 
histone demethylases (JMJs). JMJs remove methyl! groups from histone lysines using a-ketoglutarate (aKG) as 
a substrate. Nuclear KGDH depletes nuclear «KG, preventing histone demethylation and altering transcription. 
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puts do not necessarily require intact me- 
tabolism. Thus, late flowering arose from 
altered metabolite signaling, not a primary 
metabolic impairment. 

Several JMJ demethylase families 
are found across eukaryotic species (9). 
Because individual JMJs have different af- 
finities and selectivities for methylated his- 
tone lysines, it is possible that regulated, 
JMJ family member-specific association 
with KGDH confers locus-specific con- 
trol. In support of this possibility, through 
chromatin immunoprecipitation-quantita- 
tive polymerase chain reaction (PCR) ex- 
periments, Huang et al. observed the spe- 
cific regulation of H3K4me3, H3K9me2, 
H3K27me3, and H3K36me3 methylation 
status by KGDH. They also observed se- 
lective occupancy of different KGDH-JMJ 
complexes within the gene bodies, 5’-tran- 
scribed regions, or both of different genes. 

Several important questions will need to 
be addressed in future research. The sig- 
nal transduction mechanism by which light 
stimulates nuclear KGDH translocation re- 
mains undefined. Deciphering the signaling 
transducers of light exposure could reveal 
additional inputs from nonlight environ- 
mental stimuli that converge on common 
signaling hubs to increase nuclear KGDH 
activity. In this case, the precision of tran- 
scriptional responses to discreet stimuli 
could be provided by additional, event-spe- 
cific signals that modulate the composition 
of KGDH-JMJ and other regulatory protein 
complexes. Furthermore, mapping the full 
light signal transduction pathway could also 
provide opportunities for future research 
to test whether KGDH and other signaling 
mechanisms may be genetically modulated 
to enhance plant growth in low light. 
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Although Huang et al. showed that as 
in Arabidopsis, human KGDH and JMJs 
physically interact, the human E2 KGDH 
subunit lacks the nuclear localization se- 
quence present in Arabidopsis that the 
study demonstrated is required for nu- 
clear KGDH translocation and repression 
of histone demethylation. Thus, future 
work will be required to test whether this 
phenomenon persists in mammals, with 
KGDH nuclear translocation potentially 
regulated by a different mechanism. This 
is intriguing, because although aberrant 
mitochondrial metabolism in cancer sys- 
tems has been shown to increase levels 
of succinate and 2-hyroxyglutarate, which 
inhibit JMJ activity (70, 11), a mechanism 
for activating mammalian JMJs has not 
been identified. 
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Phage protein E shuts 
down cell wall biosynthesis 
to kill bacteria 
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o release infectious progeny, many 
bacteriophages induce lysis of the host 
bacterium. One intriguing mechanism 
involves the single-gene lysis (SGL) 
phage proteins (J). In contrast to anti- 
microbial peptides, most of which in- 
teract directly with the bacterial membrane 
to form membrane pores (2), SGL proteins 
induce the host cell to undergo autolysis. 
Expression of the 91-residue transmembrane 
protein E from the archetypal phage ®X174 
is sufficient to induce autolysis in Escherichia 
coli (3). On page 180 of this issue, Orta et 
al. (4) report the structure of protein E in 
complex with host cell proteins MraY (phos- 
pho-MurNAc-pentapeptide translocase) and 
SlyD (sensitive to lysis D). Protein E enlists 
SlyD to inhibit MraY and thus peptidoglycan 
biosynthesis. Ensuing defects in the cell wall 
impair resistance to turgor pressure, leading 
to host cell lysis. Phages engineered to ex- 
press protein E could serve as an antibacte- 
rial therapeutic strategy. 
Almost 40 years ago, the ability of protein 
E to induce bacteriolysis was shown to be de- 
pendent onthe presence ofahostcytoplasmic . 
protein, SlyD (3). SlyD exhibits three distinct 
chaperone activities: The FKBP (FK506 bind- 
ing protein) domain acts as a peptidyl-prolyl 
cis-trans isomerase (5, 6), the IF (“insert in 
flap”) chaperone domain stabilizes protein- 
folding intermediates through donor-strand 
complementation (7), and the C-terminal 
histidine-rich sequence is implicated in 
metalloprotein assembly (8). However, un- 
til now, none of these activities could be 
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correlated with SlyD’s role in bacteriolysis. 

The 34-residue transmembrane N-terminal 
domain (NTD) of protein E is necessary, but 
not sufficient, for inducing lysis (3); this 
requires the presence of the 57-amino acid 
cytosolic C-terminal domain (CTD), which 
varies in sequence among ®X174-related 
phages. Notably, replacement of the CTD 
with an autonomously folding domain re- 
sults in lysis even in the absence of slyD. 
Screening of lysis-inducing protein E mu- 
tants in a slyD-null background revealed 
the involvement of a second host protein, 
MraY (9). This dimeric integral membrane 
enzyme (10) catalyzes a crucial step in cell 
wall biosynthesis, namely formation of the 
prenylated peptidoglycan precursor lipid I, 
suggesting that protein E acts by disrupt- 


(11) shows that the kinks allow the peptide 
inhibitor to block access to both the lipid and 
the nucleoside binding sites simultaneously, 
effectively supressing substrate binding. The 
CTD of protein E, which is required for lysis in 
wild-type E. coli, is held in place by two SlyD 
molecules. An amphiphilic a helix of protein 
E traverses the cytoplasmic face of MraY par- 
allel to the membrane, sandwiched between 
the enzyme’s active site and the IF domain 
of one SlyD molecule. The peptide then pro- 
ceeds in an extended conformation to bind 
to the IF’ domain of the second SlyD’ mole- 
cule through donor-strand complementation, 
ending with Pro® occupying the second SlyD’ 
FKBP’ domain active site. Together with a sec- 
ond protein E’ moiety, both active sites of the 
MraY dimer are blocked. It is remarkable that 
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ing the integrity of the bacterial cell wall. 

Orta et al. pull these disparate strands 
together to elucidate the role of each player 
in bacteriolysis. Coexpression of FE. coli 
MraY with the ®X174 E protein resulted 
in a stable complex that copurified with FE. 
coli SlyD, which the authors dubbed the 
YES complex (MraY-E-SlyD). Using single- 
particle cryo-electron microscopy, the struc- 
tures of YES,,.,,, and a related complex could 
be traced to single-residue precision, with the 
tripartite complex forming a dimer of trimers 
(see the figure). 

The study of Orta et al. reveals a surpris- 
ingly simple yet elegant principle behind 
protein E action. The NTD hugs a groove in 
the surface of one monomer of MraY in a hel- 
ical conformation, with kinks at conserved 
prolines in protein E. A comparison with the 
carbacaprazamycin-inhibited MraY structure 
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there are no direct contacts between MraY 
and SlyD or between the two chaperone mol- 
ecules SlyD and SlyD’, with protein E mediat- 
ing all interactions between the host proteins. 

Thus, phage protein E couples two func- 
tionally unrelated bacterial proteins, MraY 
and SlyD. This supresses MraY substrate 
binding, which inhibits peptidoglycan bio- 
synthesis and thereby initiates host cell lysis. 
The chaperone function of SlyD is co-opted 
to stabilize the CTD of protein E, preventing 
cellular degradation of its (presumably) oth- 
erwise disordered C terminus. Why should 
protein E have evolved to hijack SlyD rather 
than make use of a folded CTD? The answer 
lies in the structure of the phage genome: 
The gene encoding protein E is embedded as 
an alternative reading frame in the gene of 
scaffolding protein D. Maintaining the integ- 
rity of the protein D gene puts limitations on 


the genetic space available to that of protein 
E. Indeed, the necessity to encode multiple 
functional proteins in a restricted genome is 
reflected in the diversity found in SGL target 
proteins (including MurA, which catalyzes 
the first committed step in peptidoglycan 
biosynthesis, and MurJ, the conserved lipid 
II flippase) and accessory proteins (including 
the chaperone DnaJ) (7). Overall, this sug- 
gests that lytic functions have arisen sponta- 
neously and opportunistically. 

It is not only phages that have co-opted the 
potential of SlyD. Fusion of one or two copies 
of the chaperone to the viral envelope pro- 
teins HIV gp41 (72) and rubella virus E1 (73) 
facilitates high-yield recombinant protein 
production in F. coli. The SlyD fusion pre- 
vents aggregation of the viral-antigen protein 
fragments, maintaining antibody binding 
competency. This has fostered the commer- 
cialization of diagnostic kits for serological 
quantification of antiviral antibodies in pa- 
tients. This procedure was used to stabilize 
the severe acute respiratory syndrome coro- 
navirus 2 (SARS-CoV-2) receptor binding do- 
main of the Spike protein, which facilitated 
the rapid development of the double-antigen 
sandwich assay Elecsys Anti-SARS-CoV-2-S 
for determining antibody titers in patients af- 
ter SARS-CoV-2 infection or vaccination (J4). 

More than a century ago, phage pioneer 
Félix Hubert d’Hérelle recognized the pos- 
sibilities afforded by phage-induced bacte- 
riolysis in the treatment and prevention of 
bacterial infections (75). Eclipsed after World 
War II by the advent of antibiotics such as 
penicillin, research into the therapeutic and 
prophylactic use of bacteriophages continued 
in the USSR. The ever-increasing emergence 
of antibiotic-resistant bacterial strains has 
reawakened interest in phage therapy. How- 
ever, bacteriophage use is limited by their 
narrow strain specificity, so that applications 
tend to require mixtures of phages. Under- 
standing the molecular mechanism of ®X174 
protein E raises the possibility of designing 
lytic phages—or equipping nonlytic phages 
with lytic genes—for the targeted eradication 
of antibiotic-resistant bacteria. 
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RETROSPECTIVE 


Donald D. Brown (1931-2023) 


Father of molecular embryology 


By Steven McKnight! and Allan Spradling? 


onald D. (“Don”) Brown, ground- 

breaking biologist who studied iso- 

lated genes in test tubes, died on 31 

May. He was 91. The technique Brown 

pioneered, which he called “develop- 

mental genetics by gene isolation,” 
allowed him to, for the first time, purify an 
animal gene and elucidate its structure and 
regulatory mechanism. The advent of unlim- 
ited gene purification by recombinant DNA 
cloning generalized Brown’s approach. This 
gene-centric strategy unleashed a scientific 
revolution that unified genetics, developmen- 
tal biology, and cell biology. 

Born on 30 December 1931 in Cincinnati, 
Ohio, Brown attended Dartmouth College 
for 3 years and then earned his MD and an 
MS in biochemistry from the University of 
Chicago in 1956. After a yearlong internship, 
he spent 2 years as a National Institutes of 
Health fellow with neuroscientist Seymour 
Kety. He conducted postdoctoral studies with 
biochemist Jacques Monod at the Pasteur 
Institute in Paris and then moved to the 
Department of Embryology at the Carnegie 
Institution of Washington in Baltimore. He 
remained there for the rest of his career, be- 
coming director of the department in 1976 
and retiring with emeritus status in 2005. 

When Brown was in medical school, em- 
bryo development was scarcely covered in the 
curriculum. He recognized that developmen- 
tal biology was not yet a field but rather a set 
of questions. After the structure of the DNA 
double helix was published in 1953, Brown 
made it his scientific mission to merge DNA 
research, biochemistry, and embryology. He 
had the drive (as his opponents on the tennis 
court would attest), the self-confidence, and 
the courage to undertake this task. 

Life at the Pasteur Institute inspired 
Brown, as work on bacterial operons (gene 
clusters regulated together) climaxed. He 
thrived in the lunchtime company of such 
luminaries as eventual Nobel laureates 
Monod, Francois Jacob, and André Lwoff. 
In contrast to the Pasteur Institute, Brown 
described the 1960 Carnegie Embryology 
Department as an “old-fashioned histology 
laboratory, or perhaps a museum.” Yet, in 
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its atmosphere of intellectual freedom and 
ample funding, he flourished. 

The role of RNA as a direct product of 
genes had recently been recognized, and 
Brown decided to analyze RNA synthesis in 
frog embryos. He and John Gurdon found 
that a frog mutant lacking nucleoli pro- 
duced neither of the major ribosomal RNAs 
(rRNAs), indicating that nucleoli were the 
normal site of rRNA production. Wondering 
why frog oocytes contained many more nu- 
cleoli than did frog somatic cells, Brown and 
Igor Dawid showed that the number of rDNA 
genes greatly increased (“amplified”) during 
oogenesis to support the massive ribosome 
production needed for each oocyte, a discov- 
ery independently made by Joseph Gall. 


The amplification and unusual base com- 
position of rDNA made it possible to purify 
these genes according to their specific den- 
sity, a feat first accomplished by Max Birnstiel 
in Geneva. Brown purified the genes encod- 
ing the smaller, more easily studied 5S rRNA 
genes and found that they could be accurately 
transcribed in vitro. Using recombinant DNA 
methods, Brown isolated plasmids contain- 
ing the frog 5S genes, making them the first 
eukaryotic genes to be cloned. These meth- 
ods led to Brown’s astonishing discovery that, 
unlike bacterial operons, 5S gene transcrip- 
tion was controlled by sequences internal 
to the gene. Soon thereafter, he and Robert 
Roeder found that this control region binds 
a transcription factor, which they designated 
TFIIIA. The first gene-specific transcription 
factor described for eukaryotic cells, TFHIA 
became the founding member of the zinc- 
finger class of protein domains. 

Brown naturally grasped the motivations 
and idiosyncratic differences among young 


scientists, allowing him to find and me ie 
raw talent. As director, he transformed “ure 
Embryology Department into a place re- 
nowned for fostering faculty creativity and 
training future leaders. Brown hired dis- 
tinctive individuals and gave them free rein 
to chart the department’s course. His role 
was to support and encourage faculty so as 
to bolster the creativity of their research. 
“Life is too short not to focus on your best, 
most original ideas,’ he insisted. Those who 
worked there, including both of us, enhanced 
every aspect of their scientific capabilities. 
Generations of former students and postdoc- 
toral fellows credit their long and successful 
careers to their time in Brown’s department. 

Brown made a department with only eight 
faculty intellectually larger by hiring young 
staff associates, mostly new PhDs, who were 
fully independent and adequately funded 
for 5 years. The success of Carnegie staff as- . 
sociates motivated multiple institutions to 
launch similar fellows programs. Countless 
first-time visitors remarked that the depart- 
ment was much smaller than they expected. 
A review committee once reported that it had 
detected no trace of administration in the de- 
partment, a comment Brown—who took on 
the bulk of the administrative duties himself 
to prevent disruption to his staff—considered 
high praise. One of us (A.S.) succeeded Brown 
as director of the department in 1994 and 
continued to promote “the Carnegie style” 
that his department epitomized. 

Brown favored interactions that were short, 
clear, and direct, whether in scientific discus- 
sion, written communication, or personnel 
evaluation. He would open meetings with 
a brief, riveting address that was insightful, 
humorous, and unconventional. His advice 
was sought by many institutions, including 
the Howard Hughes Medical Institute. His 
shrewd, unexpected suggestions were often 
highly influential. 

Brown received multiple awards for his 
scientific achievements and leadership, in- + 
cluding the 2012 Lasker-Koshland Special . 
Achievement Award in Medical Science 
(shared with Tom Maniatis) for seminal con- 
tributions to understanding and manipulat- 
ing genes and for mentoring young scientists. 
A champion of basic biological research who 
saw youth asits seed corn, Brown tirelessly em- 
powered young scientists. He founded and for 
more than four decades nurtured perhaps his 
most fitting legacy: the Life Sciences Research 
Foundation, which has supported more than 
700 postdoctoral researchers. Don Brown ad- 
vanced the metamorphosis of developmental 
biology into a true discipline and made the 
scientific enterprise as a whole a more per- 
fect servant of our highest ideals. 
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RESPONSIBLE RESEARCH 


Researchers’ responsibility 
to uphold Indigenous rights 


Too often research brings harm to Indigenous peoples 


By Lawrence Ignace’, Lauren Burton!, Sara 
Mynott'2, Mairi Meehan’, Erica Olson’, Jade 
Steel’, Jaime Ojeda", Sarah Harper’, Luisa 
Ramirez’, Dana Baker’, Laurel Sleigh’, Caitie 
Frenkel’, Chris Rhodes’, Natalie C. Ban? 


he United Nations Declaration on the 
Rights of Indigenous Peoples (UND- 
RIP), the most comprehensive inter- 
national human rights instrument 
concerning Indigenous peoples, was 
adopted in 2007 by 144 countries to 
ensure protection of Indigenous rights and 
self-determination (J). While directed at 
states, UNDRIP should guide all levels of 
society. With governments lagging in its im- 
plementation, we argue that researchers—at 
universities, government institutions, consul- 
tancies, and elsewhere—have a responsibility 
to understand and advance these rights. As 
Indigenous and non-Indigenous researchers 
with backgrounds and experiences partner- 
ing with Indigenous peoples, we reviewed 
the articles of UNDRIP to identify those rel- 
evant to researchers. We summarized these 
into four themes: self-determination; free, 
prior, and informed consent; intellectual 
property; and engagement and _ learning. 
These articles provide a starting point for re- 
searchers to engage with and become allies 
in upholding Indigenous rights, in a way that 
supports Indigenous self-determination and 
sovereignty now and into the future. 

Since 1948 the Universal Declaration of 
Human Rights (UDHR) has outlined funda- 
mental rights to which all people are inher- 
ently entitled. Yet despite some benefits of 
the UDHR, such as influencing the devel- 
opment of international human rights law, 
its focus is on individuals, leaving many 
gaps for Indigenous peoples, who, despite 
UDHR, continue to be subjected to coloni- 
zation, dispossession, discrimination, and 
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genocide. Although there are jurisdictions 
that recognize Indigenous rights in law, 
such as in Bolivia and New Zealand, this 
is far from universal, and rights in law are 
often not adequately enforced. In response 
to all of this, leaders of Indigenous com- 
munities increasingly pressed for a distinct 
document that would go above and beyond 
UDHR and emphasize and enshrine unique 
rights to self-determination, lands, terri- 
tories and resources, culture, identity, and 
language. The culmination of this massive 
effort was the UNDRIP. 

Indigenous peoples have inherent rights 
“which derive from their political, economic, 
and social structures, and from their cul- 
tures, spiritual traditions, histories and phi- 
losophies, and especially their rights to their 
lands, territories and resources” (7). Although 
UNDRIP does not go so far as recognizing 
Indigenous sovereignty, it does summa- 
rize the minimum standard of rights for 
Indigenous peoples to be respected by states. 
UNDRIP is important because it attempts 
to right wrongs; equalize the uneven power 
relationship that resulted from colonialism, 
exclusion, and systemic racism; and navigate 
toward potential futures that uphold the 
rights and sovereignty of Indigenous peoples. 
UNDRIP is unique because it comprehen- 
sively articulates Indigenous rights relevant 
to these aims, unlike other agreements that 
affect, but are not explicitly focused upon, 
Indigenous peoples (e.g., Nagoya Protocol on 
Access and Benefit Sharing; Convention on 
Biological Diversity). 

Implementation of UNDRIP by states 
varies. Some countries (e.g., Ecuador) have 
aligned their constitutions with UNDRIP, and 
others have applied it to respect Indigenous 
rights in court cases (e.g., Belize, Chile), 
whereas others are lacking any implementa- 
tion. Researchers thus have a responsibility 
to act now to uphold UNDRIP because too 
often research continues to bring harm to 
Indigenous peoples and their territories. How 
research is conducted can have implications 
for Indigenous rights, self-determination, and 
sovereignty, especially when research takes 
place on Indigenous lands and waters and in- 


volves Indigenous peoples, their knowle sid 
wisdoms, cultural items, etc. (2). Some-ca-— 
amples of violations of Indigenous rights by 
researchers include using biological samples 
for secondary research purposes without con- 
sent (3), and biopiracy of Indigenous knowl- 
edge for commercial exploitation (4). More 
subtle practices of scientific colonialism in- 
clude non-Indigenous researchers implying 
ownership with phrases such as “my study 
sites” and “my Indigenous community,’ not 
including original (Indigenous) names when 
referring to species and places, or claiming 
to have made “discoveries” while ignoring 
Indigenous knowledge. Such practices are 
common, amplifying the need for research- 
ers to ensure that Indigenous peoples have 
the foresight, power, and authority to deter- 
mine what, how, and why research happens 
in their territories. 

UNDRIP implementation by research- . 
ers exists within three interrelated spheres 
of control, influence, and interest (5). 
Researchers hold power, privilege, and re- 
sponsibility as they have control over where, 
when, how, and why their research is carried 
out, throughout all stages in the research 
process, at the project and program levels 
(5). Whereas these aspects of research are 
within the direct control of researchers, oth- 
ers can be influenced, but not controlled, 
such as how research is conducted at the 
institutional level (e.g., changing knowledge, 
attitudes, and skills through interpersonal 
relationship building; contributing to or 
chairing committees; and supporting insti- 
tutional governance reform). Beyond their 
spheres of control and influence lies their 
sphere of interest, where decisions within 
the sphere of control (e.g., sharing research 
findings with practitioners and policy-mak- 
ers) interact with governance and policy de- 
cision-making and can affect concrete social, 
economic, and environmental changes. How 
much researchers can affect the spheres of 
control, influence, and interest depends on + 
the individual and the autonomy of their 
position (e.g., a research assistant may have 
little control, whereas a university president 
can shape policy). 


RESEARCH-RELEVANT THEMES 

The researcher-relevant themes that we iden- 
tified in UNDRIP are especially important 
to Indigenous peoples because of the long- 
standing attempts by colonizers and settlers 
to erase their inherent rights and delegiti- 
mize their worldviews and relationality. 


Self-determination 

A core theme in UNDRIP is that “Indigenous 
peoples have the right to self-determination” 
(Article 3) and includes Indigenous peoples’ 
right to self-government (Article 4), main- 
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taining and strengthening their institutions 
(Article 5), the right to practice and revital- 
ize cultural traditions and customs (Article 
11.1, and a right to participate in decision- 
making (Article 18). This means that uphold- 
ing UNDRIP requires conducting research 
in a way that allows Indigenous peoples to 
determine what and how research happens 
in their territories. Indigenous peoples draw 
upon origins from their worldviews, lands, 
and creation stories, which form their moral 
and societal responsibilities to the environ- 
ment and their communities (6, 7). The impli- 
cation for research in support of Indigenous 
self-determination will be the acceptance 
and manifestation of Indigenous ways of 
knowing and being across all research stages 
and disciplines. That researchers must rec- 
ognize and engage with Indigenous ways 
of knowing does not necessarily mean it 
must then replace other ways of knowing. 
For example, the concept of Etuaptmumk 
(M’kmaw for “Two-Eyed Seeing”) provides 
a conceptual framework for embracing mul- 
tiple perspectives within a system (8). 

Researchers should be guided by UNDRIP 
in creating space for equitable relation- 
ships and places of sharing and learning. 
Researchers can create opportunities for 
Indigenous peoples to freely determine their 
own research agendas by co-creating re- 
search objectives, questions, and approaches. 
In relationship with their Indigenous com- 
munity partners, researchers can bring to 
the forefront Indigenous epistemologies 
and Indigenous-led approaches that sup- 
port Indigenous self-determination. For 
Indigenous peoples, these steps can help 
with the continued survival of their cultures, 
languages, and lands and ideally can support 
Indigenous sovereignty (9, 10). 


Free, prior, and informed consent 

UNDRIP states that Indigenous people 
must give “free and informed consent prior 
to the approval of any project affecting their 
lands or territories and other resources” 
(Article 32.2), and “before adopting and 
implementing [...] measures that may affect 
them” (Article 19). Researchers must ensure 
that free, prior, and informed consent is ob- 
tained for all stages of the research process, 
as is necessary for Indigenous self-determi- 
nation. This goes beyond informed consent 
of individual research participants—which 
is required for all human subjects—to in- 
clude ongoing consent to conduct research 
in Indigenous territories. Ensuring con- 
sent requires that researchers accept that 
Indigenous peoples may not be interested 
in the proposed research, i.e., researchers 
must be willing to accept “no” as an answer. 
Additionally, priorities may shift over the 
course of a project; informed consent must 
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be ongoing and revisited. If consent is with- 
drawn during the course of a research proj- 
ect, and the research cannot be reshaped 
to continue collaboration, researchers need 
to communicate with relevant parties (e.g., 
funders, other collaborators) to identify 
next steps. Ensuring consent, and prevent- 
ing harms, require careful consideration of 
how data collection, new knowledge devel- 
oped, and outputs generated could infringe 
on Indigenous peoples’ rights. It also re- 
quires understanding the Indigenous con- 
text, including historical experiences and 
desired futures. Repositioning Indigenous 
priorities within research can build ben- 
eficial and lasting relationships, address 
Indigenous needs, strengthen reconcilia- 
tion, and improve research. 


tural heritage, traditional knowledge, and 
traditional cultural expressions” (Article 
31.1). Also emphasized is redress for “cul- 
tural, intellectual, religious and spiritual 
property taken without their free, prior and 
informed consent or in violation of their 
laws, traditions, and customs” (Article 11.2). 

Indigenous data sovereignty conveys the 
right of an Indigenous group to reclaim and 
govern the use of its own intellectual prop- 
erty, including data collection, organization, 
and storage (17). The importance of data 
sovereignty is further affirmed by recent de- 
velopment of Indigenous data governance 
frameworks [e.g., (77)] and tools, which in- 
clude procedures that researchers must adopt 
to increase transparency, protect Indigenous 
intellectual property, and align with specific 


Luis Levil, a Huilliche-Chilote fisher, and PhD student Jaime Ojeda feed hake offal to seabirds in southern 
Patagonia, Chile, as part of a research project about the hake fishery. 


Engaging with the highest level of ethical 
standards, and analyzing risk, are important 
to identify potential research impacts at the 
outset. Many Indigenous communities have 
created their own ethics guidance to create 
equitable research processes [e.g., (J0)], in- 
cluding how to obtain their free, prior, and 
informed consent. Indigenous peoples’ guid- 
ance should be followed where research is 
undertaken in their respective territories. 


Intellectual property 

Ensuring Indigenous peoples control their 
own intellectual property is reflected in 
several Articles of UNDRIP. For example: 
“Indigenous peoples have the right to main- 
tain, control, protect and develop their cul- 


Indigenous protocols (e.g., codeveloping 
agreements for research collaboration and 
data sharing). Researchers have a respon- 
sibility to protect Indigenous data because 
governing their own intellectual property is 
fundamental to self-determination (11). 


Engagement and learning 

UNDRIP states that Indigenous peoples have 
a right to establish educational systems “in a 
manner appropriate to their cultural meth- 
ods of teaching and learning” (Article 14.1); 
to access “all levels and forms of education 
of the State without discrimination” (Article 
14.2); to transmit knowledge outside of edu- 
cational institutions and to “retain their own 
names for communities, places and persons” 
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(Article 13.1); to be “appropriately reflected in 
education and public information” (Article 
15.1); and to “understand and be understood” 
in policymaking (Article 13.2) and in media 
(Articles 16.1 and 16.2). 

Researchers in all disciplines must re- 
spect Indigenous methodologies and _ rec- 
ognize the knowledge that already exists in 
areas where they study. Researchers have a 
responsibility to share findings with partici- 
pants so they can be transmitted to future 
generations and should do so in an under- 
standable format and without institutional 
barriers. Indigenous knowledge and perspec- 
tives should be appropriately represented 
in the work researchers share with their 
peers or the public. Indigenous research- 
ers bear considerable responsibilities of 
representing Indigenous voices in research. 
Non-Indigenous researchers can support 
Indigenous peoples by improving capacity 
for collaboration and innovation to support 
self-determination. 

Good practice in all community-engaged 
research—which is essential when working 
in Indigenous territories—is to share proj- 
ect updates and results frequently with the 
Indigenous communities with whom they 
collaborate; publish results in open-source 
journals and nonacademic formats whenever 
possible and where permission is granted; ac- 
knowledge relevant Indigenous community, 
history, knowledge, and place names; and 
support Indigenous colleagues and emerging 
scholars through supporting grant writing, 
hiring Indigenous research assistants, and 
creating mentorship opportunities. 


MOVING FORWARD 
Researchers have a responsibility to uphold 
Indigenous rights by reflecting on the articles 
of UNDRIP as they relate to their research. 
Researchers are well-positioned to lead by 
example not only in their own work, but 
also in influencing the related institutions 
and funders that support research. Working 
with and for Indigenous peoples in a good 
way requires understanding the effects of 
colonial histories on Indigenous peoples and 
the contribution of research and research in- 
stitutions to past and ongoing colonial lega- 
cies, as well as the systemic barriers that con- 
tinue to exist that disadvantage Indigenous 
peoples, recognizing existing capacities and 
strengths, and advocating for change. 
Researchers control and shape how they 
carry out their own work and hence there 
is no excuse for not upholding Indigenous 
rights. To uphold UNDRIP, researchers must 
adopt a transdisciplinary research approach 
that engages Indigenous methodologies, 
where appropriate, instead of prioritizing 
Western research framing (12). Indigenous 
methodologies are based on the worldview, 
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values, and epistemology of Indigenous 
peoples, and hence are diverse. Often, they 
challenge dominant Western methodologies 
that marginalize, misrepresent, or silence 
Indigenous voices and perspectives. The 
potential misalignment between the two 
arises from fundamental differences in epis- 
temologies, ontologies, and research meth- 
odologies. Frictions might occur as a result 
of siloed mindsets and Western methodolo- 
gies that prioritize supposed objectivity and 
quantitative data over Indigenous ways of 
knowing, or when Indigenous frameworks 
challenge Western notions of ownership, 
consent, or data sovereignty. By rebalancing 
research dynamics and shifting power struc- 
tures, knowledge coproduction becomes the 
standard, creating and maintaining ethical 
space (9). It takes time to conduct research 
respectfully through meaningful dialogue 
and relationships. Much guidance exists on 
partnering with Indigenous peoples (7-13), 
and researchers need to do their own learn- 
ing and reflecting prior to engagement. 
Researchers can play an important role 
in influencing institutions and education 
spaces (e.g., universities, government re- 
search institutes, funding agencies) to embed 
Indigenous self-determination in research 
practices (6) by compelling institutions to 
develop processes that ensure UNDRIP is 
followed. For example, much like requiring 
human and animal ethics applications, and 
permission to access private or state-owned 
lands, institutions must mandate research- 
ers to follow protocols, conditions, and per- 
missions set and required by the Indigenous 
peoples on whose territory (whether State- 
recognized or not) they wish to conduct re- 
search, and to only proceed with research 
when permissions are granted. Institutions 
must ensure that processes and resources 
are available, especially for Indigenous 
peoples to build and maintain the capacity 
to respond to research requests and partici- 
pate as equal or leading research partners. 
Furthermore, researchers have the power to 
influence funders in adapting funding mech- 
anisms that enable Indigenous peoples to 
lead research in their territories. Leadership 
is especially needed from senior researchers 
and disciplinary leaders to transform the per- 
formance measures and cultures dominant 
in many research institutions that hinder the 
development of research practices consistent 
with UNDRIP. This should not be a burden 
imposed solely on young researchers and 
Indigenous scholars and communities. 
Implementing UNDRIP through research- 
ers’ spheres of control and influence will set 
the stage for supporting broader social, eco- 
nomic, and environmental desirable changes 
within their sphere of interest. Researchers 
can affect systemic change by support- 


ing Indigenous sovereignty—for example, 
through sustained research partnerships 
and following Indigenous-led approaches. As 
such, researchers can contribute to broader 
policy changes needed to ensure that the in- 
herent rights of Indigenous peoples are rec- 
ognized in all aspects of society. Examples 
range from framing fisheries management 
with Indigenous lenses (13) to the Ecuadorian 
and Bolivian constitutions being based on 
Indigenous epistemologies (14). 

It takes long-term commitment and cre- 
ativity to move beyond the tried and tired re- 
search practices common among today’s co- 
lonial research paradigms (12, 15). UNDRIP 
is a good starting point, but we also recog- 
nize that serious concessions continue to 
be required by Indigenous peoples within 
UNDRIP. For instance, although UNDRIP 
recognizes Indigenous self-determination, it 
does not go so far as to support Indigenous 
sovereignty. Indigenous ontological and epis- 
temological frameworks offer different ap- 
proaches in reconsidering more innovative 
research learning opportunities to guide our 
shared future. The research community has 
an opportunity and responsibility to work 
more closely with Indigenous peoples to cre- 
ate a more equitable and sustainable future 
based on the rights and concepts within 
UNDRIP, and beyond. 
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TECHNOLOGY POLICY 


Delivering democracy 


Better digital tools can improve government services 


and strengthen nations 


By Camille Crittenden 


n Recoding America, former US deputy 
chief technology officer for govern- 
ment innovation Jennifer Pahlka offers 
a remarkably compelling, inspiring, and 
entertaining read about government 
technology. Her behind-the-scenes look 
at entrenched US systems of government 
service delivery illustrates how she and her 
teams have overcome bureaucratic sludge to 
improve lives. As the founder and former ex- 
ecutive director of Code for Amer- 
ica, a nonprofit organization that 
aims to strengthen democracy by 
advancing digital tools for govern- 
ment service, she is well qualified 
to address these topics. 
Recoding America is as much 
about clashing cultures as it is 
about software development 


Recoding America: 


those convicted of certain marijuana-related 
felonies to petition for their records to be 
sealed. Sweeping in its intent, it left the bur- 
den of petitioning on those who had been 
convicted. Pahlka describes how she and her 
team at Code for America worked with the 
San Francisco district attorney to develop 
tools to automate record expungement, re- 
sulting in more than 8000 records being 
cleared in San Francisco alone. 

Automating the implementation of 
other, similar policies has proven more dif- 
ficult. California’s Proposition 
47, for example, reduced felony 
convictions to misdemeanors for 
those convicted of commercial 
burglary in amounts of $950 or 
less. However, there is no specific 
legal code that captures commer- 
cial burglaries or the estimated 
value of stolen property. Finding 


methodology and technology plat- Why Government redress for those convicted is a 

forms. Pahlka describes the his- Is Failing in the laborious, time-consuming pro- 

torical hierarchy between policy- _ DigitalAgeandHow _ cess that is resistant to automa- 

makers (“big picture thinkers”) We Can Do Better tion. The result is that thousands 
9 . a Jennifer Pahlka ‘4 

and IT teams (“mechanicals”) and Metropolitan Books of low-level offenders will have 

offers evidence of how closer col- 2023. 336 pp. difficulty finding housing or em- 


laboration between these groups 
would help lawmakers achieve the policy 
outcomes they seek. 

For instance, when California’s Propo- 
sition 64 came into effect in 2016, it allowed 
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ployment because of convictions 
that should have been erased. 

Government IT teams must often work 
with outdated legacy hardware, software, 
and data storage systems. Implementation 
timelines for tech upgrades lag those in the 
private sector, and the process can take 12 
to 18 months or longer. Technology capabili- 
ties can change drastically over this period, 
meaning that even before a technical devel- 


a. 


Aclient waits for the Affordable Care Act website upde 


to come back online on 31 March 2014. 


opment is undertaken, it may be eclipsed. 

Pahlka advocates that government en- 
tities adopt the more current “agile” ap- 
proach, which involves multiple rounds of 
testing and iteration before a final product 
is delivered, and she shows how a lack of 
tech agility can cause real-world hardship. 
A harrowing example of such a shortfall can 
be found in the stories of families separated 
at the border under the Trump adminis- 
tration. The computer registration system 
used by US Border Patrol officials provided 
no way to associate ID numbers assigned to 
unaccompanied minors with those of adult 
family members. To update the database 
architecture and software would have re- 
quired a new competitive procurement pro- 
cess, which could take 3 to 5 years. Officials . 
resorted to placing sticky notes with iden- 
tifying information on infants. It would be 
months before many of the children were 
reunited with their parents. 

Throughout the book, Pahlka maintains 
a consistent clarity of purpose, focusing 
on how updating a given digital process 
can improve service delivery and thus 
strengthen democracy. Reducing adminis- 
trative burden (liked by the Left) and re- 
ducing regulatory burden (favored by the 
Right) could, she believes, unite lawmakers 
across the political spectrum. 

Pahlka shares hard-won lessons for how 
to streamline workflows and simplify pro- 
cesses for end users and agencies alike. 
Gather only the information necessary for a 
given purpose, she advises. When applying 
for food assistance, for example, individuals 
need not be asked whether they own a burial 
plot (as they currently are in many states, as 
an indicator of the applicant’s assets). And 
design for mobile first. Cell phones are the 
most frequent interface used by low-income 
residents. “Paperwork favors the powerful,” + 
she writes, reminding readers that those eli- 
gible for government benefits may not have 
the time and ability to complete arduous 
documentation. Reducing paperwork will 
thus enable families with fewer resources to 
attain the assistance they need. 

Pahlka’s view of government operations 
is expansive, and I would have been inter- 
ested to learn more about her perspective 
on other timely topics, including the no- 
table presence of female leaders in civic 
technology and the opportunities and chal- 
lenges presented by generative artificial in- 
telligence for government services. I highly 
recommend this valuable and enlightening 
book to anyone in public-sector IT, as well 
as to policy-makers and their staff. @ 

10.1126/science.adi5581 
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MARINE ECOLOGY 


The ocean's disappearing giants 


A journalist documents the struggle to catch, tag, 
and restore populations of Atlantic bluefin tuna 


By Olaf P. Jensen 


e observe all animals through the 
lens of our human perspective, 
but some are especially obscured 
by the clouded glass of symbol- 
ism and story. So it is for the 
bluefin tuna, a species overfished 
to the degree that it has become a living 
shorthand for man’s wastefulness and 
greed. Few of us have seen this fish swim- 
ming free in its natural environment; they 
are rare even in aquaria. Those who see it 
at all see its death—the staccato of its cres- 
cent moon tailfin beating a ship’s 
deck, power without purchase. 

In Kings of Their Own Ocean, 
Karen Pinchin tells the stories 
of people whose lives intersect 
with Atlantic bluefin tuna, one 
of three bluefin tuna species 
found in the world’s oceans. 
Most prominently, there is Al 
Anderson, a salty charter captain 
from Rhode Island. Al preaches 
the gospel of catch-tag-and-re- 
lease and makes converts of his 
fishing clients, many of whom 
come aboard expecting to bring 
home sashimi but instead return 
to the dock with a tagging card 
full of data about the fish they 
released. There are also fascinat- 
ing appearances by the Reverend 
Sun Myung Moon and his follow- 
ers, who took over bluefin tuna 
fishing and buying in Gloucester, 
Massachusetts, in the 1970s; 
Spanish trap fishermen who scuba dive 
with penned tuna to herd them deeper 
into the traps; and tuna researcher Molly 
Lutcavage, a pioneer in the use of elec- 
tronic tags to track movements of large 
pelagic fish. 

The thread that ties these stories to- 
gether is Amelia, a one-in-a-million fish that 
was tagged and recaptured multiple times 
during her 15- or 16-year life as she traveled 
from the US Northeast to a Spanish fish 
trap in the Mediterranean, where she met 
her end. Pinchin conveys the excitement of 
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recapturing a tagged fish and the inevitable 
wonder about its life since release. 

I still, after having received hundreds 
of returned fish tags, rush to my laptop to 
look up tag numbers and find out when 
and where a fish was tagged, how much 
it grew, and, the unanswerable, what did 
it experience while it was “at large”? In 
Amelia’s case, that experience is known in 
more geographic detail, as she carried an 
electronic tag that tracked her movements 
for several months. 

The people in Kings of Their Own Ocean 
are portrayed in all their rich human com- 


Bluefin tuna are hauled into a boat near the Barbate coast in Spain. 


plexity, a quality that sets this book apart 
from many of the other nonfiction works 
focused on a single fish species that fol- 
lowed Mark Kurlansky’s 1997 international 
bestseller, Cod: A Biography of the Fish 
That Changed the World. However, Pinchin 
presents the overfishing of bluefin tuna 
mostly as a simple tale of greed. The “bad 
guys” (Japanese fishery management del- 
egations and commercial fishers) and the 
“good guys” (anglers who practice catch- 
and-release and conservationists raising 
the alarm) might as well wear black and 
white hats. Readers are introduced to a 
couple more obscure villains, including the 
concept of maximum sustainable yield and 
the partitioning of Atlantic bluefin tuna 


INSIG] A 


Kings of Their Own Ocean: 
Tuna, Obsession, 

and the Future of Our Seas 
Karen Pinchin 

Dutton, 2023. 320 pp. 


into two geographic stocks, both of which 
Pinchin blames for driving overfishing, but 
this part of the book is mostly well-trodden 
ground. If there is something new here, it 
is the meta-story of how conservationist 
and author Carl Safina managed to equate 
bluefin tuna with human greed in the 
public mind and thus launch a consumer- 
driven marine conservation movement. 

But fisheries are more complex than 
this—which is what makes them so inter- 
esting to study. The sustainability of tuna . 
fisheries is driven not only by the decisions 
of the fishermen but also by biology, eco- 
nomics, and the institutions that 
attempt to manage the whole 
social-ecological system. 

Tuna fishing is expensive, 
and the biological and economic 
equilibrium point is a function 
of costs, revenue, and the ability 
of the species to withstand har- 
vest (J). Through well-meaning 
but misguided fuel subsidies for 
fishermen, many national govern- 
ments have driven this “bionomic 
equilibrium” toward more fishing 
and fewer tuna left in the sea. The 
upshot is that across species and 
oceans, the conservation status of 
tuna can be reasonably well pre- 
dicted from two variables: their 
generation time and market price 
(2). The valuable and relatively 
long-lived Atlantic bluefin tuna 
suffers on both counts. 

Yet all is not lost. The most re- * 
cent assessment of the Western 
Atlantic stock (3) shows the population 
recovering in a long, slow arc starting 
around 2007 after fishing mortality rates 
were finally brought down to consistently 
lower levels. While it is too soon to pop the 
champagne, Pinchin ends on a note of well- 
founded optimism that bluefin tuna man- 
agement might finally be turning a corner 
on the era of unrestrained overfishing. 
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LETTERS 


Edited by Jennifer Sills 


Undue publicity for 
flawed fraud detector 


In the News In Depth story “Fake scientific 
papers are alarmingly common” (9 May, 
https://scim.ag/2KD), J. Brainard describes 
a medRxiv preprint by Sabel et al. (1) 
alleging that nearly 30% of all scientific 
articles may be the fake products of paper 
mills. Fraudulent papers are indeed a 
concern, but this story sensationalizes the 
study’s findings while downplaying its 
substantial flaws. In doing so, Brainard 
unjustly tarnishes the scientific enter- 
prise and causes undeserved harm to the 
reputation of scientists from the coun- 
tries singled out in the preprint. 

Sabel et al. propose a classifier for 
identifying fake papers. Rather than 
training this classifier using machine 
learning, the researchers preselected just 
three features to flag suspected fakes: 
whether the authors use private email 
addresses, whether they are affiliated 
with a hospital, and—unmentioned by 
the News story—whether the team lacks 
international collaborators. Brainard 
eventually cites details from the preprint 
showing that the algorithm misclassified 
more than a third of real papers as fake. 
This unacceptable false-positive rate indi- 
cates that the method has failed. 

The structure of the Sabel ez al. study 
demonstrates how such classifiers can 
be used to reinforce discrimination and 
inequity. Because papers without inter- 
national coauthors, as well as authors 
without institutional email addresses, are 


SCIENCE science.org 


deemed untrustworthy, the classifier is 
more likely to identify papers from Asia 
and the Global South, where interna- 
tional collaborations are less common (2) 
and personal email addresses are more 
frequently used (3). Yet Sabel and col- 
leagues use the output of this classifier to 
argue that fraud is more common among 
authors from these countries, without 
controlling for these well-documented 
international differences. Such circular 
reasoning unjustly disparages authors 
from these countries. 

The algorithm that Science has heralded 
is fundamentally biased in its imple- 
mentation, with potentially racist conse- 
quences. Using a highly biased algorithm 
even as a preliminary screen is unaccept- 
able without carefully considering issues 
of fairness and algorithmic harm that 
arise from its use. No such consideration 
is given, and the News story gives only 
passing mention to the risks. Science 
should be on the vanguard of drawing 
attention to how machine intelligence can 
encode racism while perpetuating and 
exacerbating traditional inequities, not an 
accessory to such wrongs. 
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Advancing equity and 
integrity in research 


In the News In Depth story “Fake scientific 
papers are alarmingly common” (9 May, 
https://scim.ag/2KD), J. Brainard draws 
attention to a number of new tools and 
technologies that researchers and pub- 
lishers are developing in an effort to 
increase the integrity of research. These 
tools primarily work by automating the 
identification of fake authors and fake 
papers with the use of a set of variables 
common in publications, such as author 
email address and affiliation. The goals 
of these technologies are valid and 
important, but the scientific community 
needs to act cautiously when determin- 
ing which variables are used to identify 
and flag individuals and manuscripts. 

A private email address could be con- 
sidered a flag for further scrutinizing a 
publication, but private email addresses 
are often used by legitimate researchers, 
many of whom are nonnative English 
speakers or use non-Latin writing 
scripts. In countries such as India and 
China, more researchers use personal 
email addresses than institutional ones 
(1). Judging the legitimacy of research 
based on this flag coupled with a hospi- 
tal affiliation, as one medRxiv preprint 
Brainard cites does (2), without any 
follow-up practice that accounts for lan- 
guage or cultural differences, would be 
a disservice to scientific advancement 
globally. 

More rigorous author and manuscript 
evaluation needs to verify individu- 
als and institutions by leveraging not 
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just email domain and affiliations but 
also author disambiguation tools and 
persistent identifier registries, such 

as Open Researcher and Contributor 
Identifiers (ORCIDs) and the Research 
Organization Registry. An additional 
analysis of publishing history—including 
the number of publications, coauthor- 
ship, and a network analysis—should be 
applied to create signals for additional 
checks on the author of a manuscript. 
These results should then be bal- 

anced with knowledge and sensitivity. 
Graduate students and new faculty with 
a light publication history are not fake 
authors, just new to the field. 

Ensuring research integrity requires 
reducing inequities and gatekeeping 
in the publishing ecosystem. Some of 
the tools cited by Brainard use limited 
checks that prematurely categorize and 
ultimately miscategorize researchers as 
nefarious actors. Both the proposed tool 
and the choice by Science to promote it 
contribute to furthering inequities rather 
than improving trust in science. 

The Committee on Publication Ethics 
(COPE) has developed sensible guidance 
(3) for editors and publishers on how to 
navigate any potential paper-mill case 
once it has been identified. Better evalua- 
tion mechanisms will allow the scientific 
community to simultaneously advance 
equity and integrity goals. 
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Research Libraries, Washington, DC, USA. 
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Editor’s note 


We thank Bergstrom and Ogbunugafor, 
as well as McIntosh and Hudson Vitale, 
for pointing out the limitations of tools 
for detecting fake papers such as the one 
developed by Sabel et al. Our story states 
prominently, in the fourth paragraph, 
that Sabel et al’s method “isn’t a perfect 
solution, because of a high false-positive 
rate.” Later we specify the high false- 
positive numbers. Far from heralding or 
sensationalizing the tool, we presented it 
as a rough indicator of a real problem. 
Tim Appenzeller 
News Editor, Science 

10.1126/science.adj3681 
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TRPM7 phosphorylates amyloid away 


he abundance of the dual-function ion channel and kinase TRPM7 


is decreased in postmortem brain samples from patients with 
Alzheimer's disease. Zhang et al. uncovered a pathological role 
for loss of TRPM7 in Alzheimer’s disease. In a mouse model of 


amyloid-B—induced pathology, synapse formation and cognitive 


function were restored in aged mice and preserved in presymptom- 


atic, younger mice upon overexpression of the kinase portion of TRPM7 
called M7CK. M7CK phosphorylated and activated the protease MMP14, 


which promoted amyloid-B degradation and clearance. —LKF 


Sci. Signal. (2023) 10.1126/scisignal.ade6325 


In mice, overexpression of the kinase M7CK 


helps to stimulate the clearance of 


amyloid plaques, shown in this illustration 


as yellow clumps around neurons. 


QUANTUM OPTICS 
Controlling quantum 
randomness 


The state of quantum mechani- 
cal systems fluctuates. In 

optics, these fluctuations can 

be harnessed for the genera- 
tion of truly random bit strings. 
The random fluctuations can 
also be used in probabilistic 
computing, but this requires the 
probability distributions to be 
controllable. Using the two-state 
phase output from an optical 
parametric amplifier as a “bit,” 
Roques-Carmes et a/. showed 
that the bit probabilities changed 
in response to a small bias field 
injected into the system. By 
changing the attenuation level of 
the bias field, the authors were 
able to traverse the continu- 

ous space between perfectly 
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random and deterministic 
state selection. The approach 
provides the prospect of using 
vacuum fluctuations as a source 
of controllable randomness for 
photonic probabilistic comput- 
ing. —ISO 

Science, adh4920, this issue p. 205 


SOLAR CELLS 
Less penetrating but 


more stable 


The thermal stability of three- 
dimensional perovskite solar 
cells can be improved by adding 
ammonium ligands that create 
a two-dimensional perovskite 
capping layers, but such ligands 
are prone to intercalation into 
the bulk. Park et al. showed that 
the smallest aromatic ligand, 
anilinium, had the lowest ligand 


reactivity with three-dimensional 
perovskites because of steric 
hindrance near the ammonium 
group, and a fluorinated 
derivative of this ligand created 
a robust interface structure. 
Encapsulated solar cells 
maintained 85% of their power 
conversion efficiency of about 
20% at 85°C and 50% relative 
humidity after about 1600 hours 
of maximum power point opera- 
tion. —PDS 

Science, adi4107, this issue p. 209 


MOLECULAR BIOLOGY 
Corepressors and 
enhancers form partners 


The regulation of gene expres- 
sion involves different signals to 
switch transcription on or off. 
Gene activation has received 


widespread attention, but 
fewer studies have looked at 
how repressive transcription 
factors and their partners 
known as corepressors silence 
transcription. Jacobs et al. now 
show that not all corepressors 
can silence all gene-activating 
regions, called enhancers. 
Instead, corepressors work 
selectively, depending on 
specific enhancer charac- 
teristics such as chromatin 
and sequence patterns, and 
altering these motifs can make 
enhancers more or less sensi- 
tive to specific corepressors. 
This finding uncovers another 
layer of gene expression con- 
trol and highlights the interplay 
between distinct mechanisms 
of transcriptional activation 
and repression. —DJ 

Science, adf6149, this issue p.198 
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CHATGPT 
Generative Al as a 
writing aid 
Automation has historically 
displaced human workers in 
factories (e.g., automotive 
manufacturing) or in perform- 
ing routine computational 
tasks. Will generative artificial 
intelligence (Al) tools such 
as ChatGPT disrupt the labor 
market by making educated 
professionals obsolete, or will 
these tools complement their 
skills and enhance productiv- 
ity? Noy and Zhang examined 
this issue in an experiment that 
recruited college-educated 
professionals to complete 
incentivized writing tasks. 
Participants assigned to use 
ChatGPT were more produc- 
tive, efficient, and enjoyed the 
tasks more. Participants with 
weaker skills benefited the 
most from ChatGPT, which 
carries policy implications for 
efforts to reduce productivity 
inequality through Al. —EEU 
Science, adh2586, this issue p. 181 


TUMOR IMMUNOLOGY 
Instructing tumor cell 
turncoats 


Cancer cells down-regulate 
antigen presentation to evade 
recognition by the immune 
system. Zimmermannova et 

al. reprogrammed cancer cells 
into tumor-derived antigen- 
presenting cells (tumor-APCs). 
Mouse or primary human 
cancer cells transduced with 

a set of transcription factors 
acquired a stable dendritic 

cell phenotype, could present 
endogenous tumor antigens 
to stimulate the effector 
function of CD8* T cells, and 
showed decreased tumori- 
genicity. In mouse models, 
intratumoral injection of 
tumor-APCs decreased tumor 
growth, increased the sur- 
vival of tumor-bearing mice, 
and enhanced responses to 
immune checkpoint inhibi- 
tors. This study highlights that 
future immunotherapies aimed 
at reprogramming cancer cells 
in situ could combine reversing 
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malignancy with mobilizing 
the cellular antigen-presenting 
machinery to enhance the 
activity of CD8* T cells toward 
tumors. —SHR 
Sci. Immunol. (2023) 
10.1126/sciimmunol.add4817 


QUANTUM NETWORKS 
Building a multichip 
quantum network 


The realization of a scal- 
able quantum entanglement 
network relies on the develop- 
ment of a scalable architecture 
and on incorporating hardware 
components that are mass 
manufacturable. Zheng et al. 
proposed and demonstrated 
a scalable architecture using 
photonic hybrid multiplex- 
ing. They realized a toolbox of 
integrated nanophotonic hybrid 
multiplexing devices that 
can be manufactured using 
standard industrial fabrication 
processes. Their chip-based 
quantum entanglement 
network could find important 
applications in quantum com- 
munications, metrology, and 
distributed quantum comput- 
ing. —ISO 
Science, adg9210, this issue p. 221 


SUPERCONDUCTIVITY 
What sets the transition 
temperature? 


The mechanism of high-tem- 
perature superconductivity in 
copper oxide materials remains 
a mystery more than 30 years 
after its initial discovery. One 
way to shed light on this is to 
look for correlations between 
different observables in cuprate 
families. Wang et al. used elec- 
tron energy-loss spectrometry 
to measure the charge transfer 
gap in aclass of bismuth-based 
cuprates with varying num- 
bers of copper oxide planes 
in the unit cell. The measured 
gap was negatively correlated 
with the maximum transition 
temperature, providing a design 
principle for further increasing 
the working temperature of 
cuprate superconductors. —JS 
Science, add3672, this issue p. 227 


IN OTHER JOURNALS 


Spotted-winged fruit fly 
in flight over a strawberry, 
one of the many fruit 
species that this insect’s 
larvae feeds on 


EVOLUTION 
A hearty eater 


Edited by Caroline Ash 
and Jesse Smith 


any plant-eating insects are picky eaters and special- 

ize on a single host plant. Some species do have 

generalist appetites, but whether they use the same 

biochemical processes as specialists to convert 

different foods into energy and other bioactive 
molecules is not well understood. Olazcuaga et al. fed a gen- 
eralist species, the spotted wing fruit fly, on blackcurrant, 
cherry, cranberry, and strawberry separately and then mea- 
sured the chemical compositions of the fruits and the flies. 
The authors found that the same metabolism was used by 
flies to convert fruit into energy regardless of which species 
they ate. Consequently, some harmful chemicals unique to 
a fruit were not detoxified and remained in the body. There 
may thus be a negative tradeoff for the advantages of being 
a generalist. —DJ eLife (2023) 10.7554/eLife.84370 


NEUROSCIENCE 
Unique development of 
the insular cortex 


During fetal development, the 
human brain expands to form 
cortical lobes that fold into gyri 


(ridges) and sulci (grooves). 
Models of brain folding can 
explain the genesis of individual 
gyri and sulci. However, these 
models exclude the formation 
of the insula, a region deep 
inside the cortex involved in 
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PLANT SCIENCE 
Metabolic enzymes 
moonlighting on histones 


The methylation state of 
histones affects chromatin 
structure and gene expression. 
Histone demethylase activity 
influences the distribution of 
these regulatory methylation 
marks. Huang et al. found that an 
enzyme called a-ketoglutarate 
dehydrogenase (KGDH) binds 
to and inhibits the activity of 
histone demethylases (see the 
Perspective by Taylor). KGDH 
is normally associated with the 
respiratory tricarboxylic acid 
cycle in mitochondria, but its 
substrate, a-ketoglutarate, is also 
required for histone demethy- 
lase activity. The KGDH enzyme 
enters the nucleus of Arabidopsis 
cells and can interact directly 
with both histone demethylases 
and chromatin. As a result, both 
histone demethylation and gene 
expression are altered by KGDH, 
providing an alternative role 
for KGDH beyond respiratory 
metabolism. —MRS 

Science, adf8822, this issue p.179; 

see also adi7577, p.125 


ANTIMICROBIAL PROTEINS 


Lysis by phage protein E 
With the progressive loss of 
effective antibiotics against 
many bacterial pathogens, phage 
therapy has attracted renewed 
interest as a means of treating 
infections. Some phages gener- 
ate antimicrobial proteins that 
result in lysis of their host cells. 
Orta et al. determined structures 
of the “YES” complex, consist- 
ing of the phage protein E and 
two proteins involved in cell wall 
biosynthesis, MraY and SlyD (see 
the Perspective by Balbach and 
Stubbs). Protein E blocks a sub- 
strate-binding site in MraY, thus 
blocking its essential enzyme 
activity. A better understanding 
of this mechanism may be useful 
in developing effective phage 
therapeutics. —MAF 

Science, adg9091, this issue p. 180; 

see also adi7571, p.126 
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TOPOLOGICAL MATTER 
Capturing the effects of 
the quantum metric 


The quantum metric, which 
quantifies the distance between 
neighboring wavefunctions in a 
quantum system, has been pre- 
dicted to affect many physical 
responses. One example is the 
predicted nonlinear anomalous 
Hall effect in antiferromagnets 
with particular symmetries. Gao 
et al. observed this effect ina 
structure consisting of an even 
number of layers of the material 
MnBi,Te, interfaced with black 
phosphorus. The researchers 
reached their conclusions by 
carefully ruling out alternative 
interpretations. The results 
establish the system as a viable 
setting for studying the effects 
of the quantum metric. —JS 
Science, adf1506, this issue p. 170 


QUANTUM OPTICS 
Quantum nonlinear optics 


In general, photons do not 
interact much with each other, 
requiring a nonlinear medium 
to enhance and induce an 
interaction. Such an interaction 
typically requires a large number 
of photons to change the refrac- 
tive index of the medium. Drori et 
al. used an optically compressed 
and trapped sample of Rydberg 
atoms to induce an extreme pho- 
ton-photon interaction regime in 
which quantum nonlinear effects 
can be seen. The formation of 
two- and three-photon vortices 
was demonstrated, providing 
a route to developing quantum 
nonlinear optics and multipho- 
ton quantum logic. —ISO 

Science, adh5315 this issue p. 187 


WOOD ENGINEERING 
CRISPR wood for 
sustainable fibers 


Trees provide an important 
natural resource, but breeding 
for optimal wood properties is 
time consuming and hindered by 
the complexity of tree genetics 
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and diversity. Sulis et al. show 
that CRISPR technologies can 
be readily deployed to enhance 
wood properties and augment 
the sustainability of forest trees 
(see the Perspective by Zuin 
Zeidler). The authors generated 
multiplexed genetic alterations 
modifying wood composition 
in poplar with more desirable 
traits for fiber pulping and lower 
carbon emissions. This work 
demonstrates that genome 
editing can be harnessed for 
breeding more efficient trees, 
which will provide timely oppor- 
tunities for sustainable forestry 
and a more efficient bioecon- 
omy. —DJ 

Science, add4514, this issue p. 216; 

see also adi8186, p. 124 


HEART DISEASE 


Open the door, HOMER! 
Atrial fibrillation (AFib), the most 
common type of heart arrhyth- 
mia, is a serious condition that 
can result in atrial blood clots 
and thromboembolic stroke. 
Hulsmans et al. performed sin- 
gle-cell RNA sequencing on atrial 
tissue from AFib patients and 
healthy controls to better under- 
stand how stromal and immune 
cells contribute to this disease. 
They found that recruited CCR2* 
SPP1* macrophages expanded 
in AFib patients. These cellular 
and transcriptomic changes 
were recapitulated in a mouse 
model of AFib that integrated 
hypertension, obesity, and mitral 
valve regurgitation (HOMER). 
Disrupting Cer2, which coordi- 
nates inflammatory macrophage 
recruitment to atria, or Spp1, 
which helps to drive inflam- 
matory fibroblast activation 
by macrophages, ameliorated 
disease burden in HOMER 
mice, suggesting two potential 
immunotherapy targets for Afib 
patients. —STS 

Science, abq3061, this issue p. 231 


LIVER FIBROSIS 
Macrophages in liver 
fibrosis and repair 


Macrophages are known to 
contribute to liver disease and 
repair, but strategies are needed 
to modulate their behavior to 
promote disease resolution. 
Moreno-Lanceta et al. showed 
that the E3 ubiquitin ligase 
RNF41 is down-regulated in 
macrophages during liver 
fibrogenesis in humans and 
mice. Macrophage-targeting 
nanoparticles bearing plasmids 
increased RNF41 expression 

in macrophages in mouse 
models of liver injury, resulting 
in improved hepatic fibrosis 
and regeneration. This work 
demonstrates the relevance 

of macrophage RNF41 in the 


F 


+ > C 
development and amelioration of 


chronic liver disease. —CAC 
Sci. Trans/. Med. (2023) 


10.1126/scitransimed.abq6225 
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CHATGPT 
Generative Al as a 
writing aid 
Automation has historically 
displaced human workers in 
factories (e.g., automotive 
manufacturing) or in perform- 
ing routine computational 
tasks. Will generative artificial 
intelligence (Al) tools such 
as ChatGPT disrupt the labor 
market by making educated 
professionals obsolete, or will 
these tools complement their 
skills and enhance productiv- 
ity? Noy and Zhang examined 
this issue in an experiment that 
recruited college-educated 
professionals to complete 
incentivized writing tasks. 
Participants assigned to use 
ChatGPT were more produc- 
tive, efficient, and enjoyed the 
tasks more. Participants with 
weaker skills benefited the 
most from ChatGPT, which 
carries policy implications for 
efforts to reduce productivity 
inequality through Al. —EEU 
Science, adh2586, this issue p. 181 


TUMOR IMMUNOLOGY 
Instructing tumor cell 
turncoats 


Cancer cells down-regulate 
antigen presentation to evade 
recognition by the immune 
system. Zimmermannova et 

al. reprogrammed cancer cells 
into tumor-derived antigen- 
presenting cells (tumor-APCs). 
Mouse or primary human 
cancer cells transduced with 

a set of transcription factors 
acquired a stable dendritic 

cell phenotype, could present 
endogenous tumor antigens 
to stimulate the effector 
function of CD8* T cells, and 
showed decreased tumori- 
genicity. In mouse models, 
intratumoral injection of 
tumor-APCs decreased tumor 
growth, increased the sur- 
vival of tumor-bearing mice, 
and enhanced responses to 
immune checkpoint inhibi- 
tors. This study highlights that 
future immunotherapies aimed 
at reprogramming cancer cells 
in situ could combine reversing 


SCIENCE science.org 


malignancy with mobilizing 
the cellular antigen-presenting 
machinery to enhance the 
activity of CD8* T cells toward 
tumors. —SHR 
Sci. Immunol. (2023) 
10.1126/sciimmunol.add4817 


QUANTUM NETWORKS 
Building a multichip 
quantum network 


The realization of a scal- 
able quantum entanglement 
network relies on the develop- 
ment of a scalable architecture 
and on incorporating hardware 
components that are mass 
manufacturable. Zheng et al. 
proposed and demonstrated 
a scalable architecture using 
photonic hybrid multiplex- 
ing. They realized a toolbox of 
integrated nanophotonic hybrid 
multiplexing devices that 
can be manufactured using 
standard industrial fabrication 
processes. Their chip-based 
quantum entanglement 
network could find important 
applications in quantum com- 
munications, metrology, and 
distributed quantum comput- 
ing. —ISO 
Science, adg9210, this issue p. 221 


SUPERCONDUCTIVITY 
What sets the transition 
temperature? 


The mechanism of high-tem- 
perature superconductivity in 
copper oxide materials remains 
a mystery more than 30 years 
after its initial discovery. One 
way to shed light on this is to 
look for correlations between 
different observables in cuprate 
families. Wang et al. used elec- 
tron energy-loss spectrometry 
to measure the charge transfer 
gap in aclass of bismuth-based 
cuprates with varying num- 
bers of copper oxide planes 
in the unit cell. The measured 
gap was negatively correlated 
with the maximum transition 
temperature, providing a design 
principle for further increasing 
the working temperature of 
cuprate superconductors. —JS 
Science, add3672, this issue p. 227 


IN OTHER JOURNALS 


Spotted-winged fruit fly 
in flight over a strawberry, 
one of the many fruit 
species that this insect’s 
larvae feeds on 


EVOLUTION 
A hearty eater 


Edited by Caroline Ash 
and Jesse Smith 


any plant-eating insects are picky eaters and special- 
ize on a single host plant. Some species do have 


generalist appetites, but whether they use the same 


biochemical processes as specialists to convert 

different foods into energy and other bioactive 
molecules is not well understood. Olazcuaga et al. fed a gen- 
eralist species, the spotted wing fruit fly, on blackcurrant, . 
cherry, cranberry, and strawberry separately and then mea- 
sured the chemical compositions of the fruits and the flies. 
The authors found that the same metabolism was used by 
flies to convert fruit into energy regardless of which species 
they ate. Consequently, some harmful chemicals unique to 
a fruit were not detoxified and remained in the body. There 
may thus be a negative tradeoff for the advantages of being 
a generalist. —DJ eLife (2023) 10.7554/eLife.84370 


NEUROSCIENCE 
Unique development of 
the insular cortex 


During fetal development, the 
human brain expands to form 
cortical lobes that fold into gyri 


(ridges) and sulci (grooves). 
Models of brain folding can 
explain the genesis of individual 
gyri and sulci. However, these 
models exclude the formation 
of the insula, a region deep 
inside the cortex involved in 
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Plastic waste (like that shown) can 
be burned but this'creates excessive 
emissions of greenhouse gases: 


integrating sensory signals 
(interoception). Using mag- 
netic resonance imaging and 
diffusion tractography, Mallela 
et al. observed that the insula 
has a different adult morphol- 
ogy and grows more slowly 
than other lobes and that these 
differences emerge early in 
development. In contrast to the 
other cortices, curved radial 
migratory streams of neural 
progenitor cells travel from a 
small area of the ventricular 
zone to reach the insular cor- 
tex. —PRS 

Proc. Natl. Acad. Sci. U. S.A. (2023) 

10.1073/pnas.2220200120 


POLITICS 


Gulf in voting 

Since the 1980s, there has 
been a significant gender gap in 
US voting. Compared with men, 
women disproportionately vote 
Democratic, which is even more 
pronounced among unmarried 
voters. England et al. examined 
representative data to under- 
stand the main drivers of the 
gender gap and found that it 
stems from racial composition 
and racial inequality. US female 
voters are disproportionately 
Black women because Black 
men are disproportionately 
disenfranchised, incarcerated, 
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or prematurely deceased, 
removing many from the elec- 
torate. Black women are also 
disproportionately unmarried. 
This discrepancy highlights the 
role of deeper social disparities 
in shaping US politics and elec- 
tion outcomes. —EEU 

Proc. Natl. Acad. Sci. U. S.A. (2023) 

10.1073/pnas.2221910120 


NEUROSCIENCE 
Alcohol alters action 
control 


Previous experiences are critical 
for decision-making throughout 
ife. Alcohol consumption has 
been shown to alter decisions, 
but a complete understand- 
ing of the nature of these 
alterations and the underly- 
ing mechanisms is lacking. 
Schreiner et al. showed that 
chronic alcohol administration 
in rodents impaired their abil- 
ity to use recent experiences 
to guide decision-making. 
Hyperactivation of the premo- 
tor cortex—-striatum circuit 
was associated with this 
impairment, and inhibiting the 
activity of this circuit restored 
experience-guided decision- 
making. Premotor regions 
might be potential targets to 
reduce alterations in executive 


PLASTIC WASTE 


transformation 


trality targets. —ECF 


functions associated with alco- 
hol use disorder. -MMa 

Cell Rep. (2023) 

10.1016/j.celrep.2023.112675 


SCIENCE EDUCATION 
A DEI requirement for 


chemistry majors 

The Amherst College chem- 
istry community aspires 

for their graduates to have 

a critical understanding of 

the implications of systemic 
discrimination as they pertain 
to chemical research and edu- 
cation. Chung et al. describe 
the formation and ongoing 
efforts of a collaborative 
partnership among students, 
faculty, and staff to identify 
avenues for the implementa- 
tion of inclusive change in 
their community. Specifically, 
the community implemented 

a diversity-equity-inclusion 
(DEI) requirement, “Chemistry 
in Society,” which engages 
students in historical and mod- 
ern-day challenges to equal 
participation and serves as 

the cornerstone for continuing 
conversations about structural 
inequality. Collectively, the stu- 
dent-staff-faculty collaboration, 
strengthened by intentional 
feedback structures, enabled 


Carbon-neutral plastic 


lastic pollution has become a worldwide problem 
that requires integral solutions. In the past few 
years, plastic transformation processes such as 
pyrolysis, gasification, and incineration, in which 
high-energy products can be exploited, have 
become common. Kwon et al. now show through model- 
ing that these processes emit large quantities of carbon 
dioxide and other greenhouse gases. They found that 
the emissions are larger than those of fossil fuel—based 
power systems per unit of power generated, which will 
become increasingly problematic as more plastic waste 
is used as a source of energy. The authors conclude that 
plastic transformation processes require effective carbon 
separation, capture, and storage technologies to reduce 
greenhouse gas emissions and comply with carbon neu- 


Energy Environ. Sci. (2023) 10.1039/D3EEQO969F 


inclusive departmental change 
and supported an ongoing feed- 
back cycle. —MMc 
J. Chem. Educ. (2023) 
10.1021/acs.jchemed.2c01190 


PRECISION AGRICULTURE 
Minimizing fertilizer 
waste 


Fertilizers and pesticides aid in 
maximizing agricultural yield, 
but it is preferable to admin- 
ister the minimums required 
without wastage that can 
become pollutants elsewhere 
in the ecosystem. Aguilar Pérez 
et al. showed that the frame- 
work material ZIF-8 can be 
used to encapsulate MiZax-3, a 
synthetic mimic of the metabo- 
lite zaxinone that is known to 
promote plant growth. Initial 
tests on tomato and pear! millet 
seedlings showed that encap- 
sulated MiZax-3 produced 
similar benefits on early-stage 
growth as Mizax-3 alone. 
Under field conditions, ZIF-8 
maintained stability of MiZax-3 
for 30 days. Capsicum plants 
showed enhanced yield, with an 
apparent synergistic effect from 
the ZIF-8 that may contribute 
zinc to aid plant growth. —MSL 
Nano Lett. (2023) 
10.1021/acs.nanolett.2c04506 
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PLANT SCIENCE 


Control of histone demethylation by nuclear-localized 
a-ketoglutarate dehydrogenase 


Fei Huang}, Xiao Luot, Yang Out, Zhaoxu Gaot, Qiming Tang, Zhenzhen Chu, 


Xinguang Zhu, Yuehui He* 


INTRODUCTION: Methylations on nucleosomal 
histones play fundamental roles in regulating 
eukaryotic transcription and other chromatin- 
based processes. Jumonji C domain-containing 
histone demethylases (JMJs) are found in most 
eukaryotes, including animals, plants, and 
fungi, and dynamically control histone methyl- 
ations. JMJs thus play a general role in regu- 
lating gene expression. JMJs are a-ketoglutarate 
(a-KG)/Fe(I]-dependent dioxygenases that 
use the cosubstrates a-KG and oxygen to re- 
move methyl groups from lysine residues by 
oxidation. 


RATIONALE: The histone demethylation activi- 
ty of certain JMJs can be regulated by sev- 
eral mechanisms in plants and/or animals, 
including posttranslational modifications of 
some JMJs and regulation by several JMJ- 
associated proteins, metabolites, and the co- 
substrate oxygen. It is currently unknown how 
nuclear JMJ activity may be regulated by other 
enzymes that metabolize a-KG. 


RESULTS: We constructed several loss-of- 
function Arabidopsis mutants in a-ketoglutarate 
dehydrogenase (KGDH), and found that KGDH 
not only played a primary role in the control of 
total cellular level of o-KG, TCA cycle flux, and 
cellular respiration, but also regulated various 
aspects of growth and development. Mutants 
with defective KGDH exhibited an accumula- 


Light promotes nuclear targeting of 
the TCA cycle-associated enzyme 
KGDH to regulate histone 
demethylation by JMJs in plants. Most 
KGDH subunits, including OGDHI, E2, 
and E3, are targeted to mitochondria 

to form a holoenzyme for a-KG 
decarboxylation in the TCA cycle. Light 
exposure promotes a small portion of 
KGDH entering into the nucleus. There, 
KGDH occupies thousands of loci and 
interacts with various JMJ histone 
demethylases to inhibit o-KG-dependent 
histone demethylations by JMJs. Inhibition 
of histone demethylation occurs by 
limiting the local availability of a-KG, and 
consequently controls gene expression. 


Huang et al., Science 381, 179 (2023) 14 July 2023 


tion of the potent antioxidants anthocyanins, 
a substantial delay in the developmental tran- 
sition to flowering, and a slowdown of shoot 
development. Next, using confocal fluorescent 
imaging, we found that a small portion of 
KGDH entered the nucleus. This process was 
regulated by exposure to light. We mutagen- 
ized a nuclear localization signal ina KGDH 
subunit, E2, and found that mitochondria- 
localized KGDH was unable to rescue the 
growth and development abnormalities in 
an e2 mutant. 

Biochemical measurements of various his- 
tone methylation marks revealed that global 
histone demethylation appeared to be inhibited 
by nuclear KGDH. We hypothesized that KGDH 
may couple with JMJs to regulate histone meth- 
ylations. Indeed, further protein-protein inter- 
action experiments revealed that nuclear KGDH 
directly interacted with various JMJs including 
demethylases of histone 3 lysine 4, histone 3 
lysine 9, and histone 3 lysine 27. 

We explored the function of nuclear KGDH 
in controlling genome-wide histone methyla- 
tion and gene expression in Arabidopsis using 
RNA sequencing and chromatin immuno- 
precipitation coupled with DNA sequencing. 
KGDH was found to occupy genomic regions 
at thousands of loci, correlating with histone 
methylation regions. Transcriptomic profiling 
revealed that thousands of genes, particularly 
environment-responsive genes, were misregulated 


6 


in mutants with defective KGDHs. Biocheny ee 
analysis uncovered that nuclear KGDH catal',-— 
oxidative decarboxylation of a-KG to reduce 
the local availability of a-KG to the JMJs coupled 
to KGDH. Reduced a-KG resulted in inhibition of 
histone demethylation by JMJs. We further 
engineered a KGDH subunit to enhance its 
nuclear targeting, which led to a further in- 
hibition of histone demethylation by JMJs. 
Moreover, we found that nuclear KGDH-JMJ 
assemblies can activate or repress the expression 
of target loci, depending on the readout of local 
histone methylation marks. 

Through protein-protein interaction experi- 
ments, we found that the evolutionarily con- 
served carboxyl-terminal regions of a KGDH 
subunit from both human and mouse inter- 
acted with mammalian JMJs. Thus, KGDH 
association with JMJs is conserved in mam- 
malian cells and potentially in other eukary- 
otic organisms. 


CONCLUSION: Here, we have uncovered that 
the TCA cycle-associated enzyme KGDH enters 
the nucleus, and that this process is regulated 
by light in plants. Nuclear KGDH interacts with 
various JMJs, catalyzes oxidative decarbo- 
xylation of o-KG, and thus inhibits o-KG- ‘ 
dependent histone demethylation by JMJs to 
regulate genome-wide gene expression in 
plants. The association of nuclear KGDH with 
JMJs is also found in mammalian cells, sug- * 
gesting that KGDH may function together 
with JMJs to control histone demethylations 
and gene expression in mammals in addi- 
tion to plants. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: yhhe@pku.edu.cn 

{These authors contributed equally to this work. 

Cite this article as F. Huang et al., Science 381, eadf8822 
(2023). DOI: 10.1126/science.adf8822 ‘ 


S READ THE FULL ARTICLE AT 
https://doi.org/10.1126/science.adf8822 


ofe H3K4me3 fe H3K27me3 fe H3K36me3 


KGDH 


> O(: 
Night os 


U Mitochondria 


lof1 


RESEARCH 


RESEARCH ARTICLE 


PLANT SCIENCE 


Control of histone demethylation by nuclear-localized 
a-ketoglutarate dehydrogenase 


Fei Huang’+, Xiao Luo**+, Yang Ou}, Zhaoxu Gao}, Qiming Tang*, Zhenzhen Chu?, 


Xinguang Zhu‘, Yuehui He??'>* 


Methylations on nucleosomal histones play fundamental roles in regulating eukaryotic transcription. 
Jumonji C domain-containing histone demethylases (JMJs) dynamically control the level of histone 
methylations. However, how JMJ activity is generally regulated is unknown. We found that the 
tricarboxylic acid cycle—associated enzyme o-ketoglutarate (a-KG) dehydrogenase (KGDH) entered the 
nucleus, where it interacted with various JMJs to regulate o-KG—dependent histone demethylations by 
JMJs, and thus controlled genome-wide gene expression in plants. We show that nuclear targeting is 
regulated by environmental signals and that KGDH is enriched at thousands of loci in Arabidopsis 
thaliana. Chromatin-bound KGDH catalyzes o-KG decarboxylation and thus may limit its local availability 
to KGDH-coupled JMJs, inhibiting histone demethylation. Thus, our results uncover a regulatory 


mechanism for histone demethylations by JMJs. 


uclear DNA wraps around histone pro- 
teins to form nucleosomes that consti- 
tute chromatin. Methylations on histones 
regulate chromatin structure, transcription, 
and other chromatin-based processes 
in eukaryotes (J, 2). Commonly methylated 
sites on histones include lysine 4 (K4), K9, 
K27, K36, and K79 on histone 3 (H3) and K20 
on histone 4 (H4) (2). Histone lysine methyl- 
transferases (KMTs) catalyze the methylation 
of lysine residues on histones, whereas histone 
lysine demethylases (KDMs) remove the methyl 
groups. Histone methylation is dynamically 
altered by KMTs and KDMs in response to 
endogenous and external inputs (/, 3). 
KDMs are composed of two evolutionarily 
conserved families, lysine-specific demethy- 
lases (LSDs), in the KDM1 subfamily, and 
Jumonji C (JmjC) domain-containing histone 
demethylases (JMJs, JHDMs, or JMJDs), a 
large family composed of a few subfamilies. 
Seven subfamilies (KDM2 to KDM7) have been 
found in mammals (J, 4). KDMIs catalyze 
flavin-dependent oxidative demethylation of 
mono- and dimethyl lysine residues including 
H3K4 and H3K9 (J). JMJs are o-ketoglutarate 
(a-KG)/Fe(1)-dependent dioxygenases, and 
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the catalytic JmjC domains use the cosubstrates 
a-KG and oxygen to remove the methyl groups 
from the lysine residues by oxidation (4, 5). JMJs 
are found in most eukaryotes, including animals, 
plants, and fungi, and play a general role in 
regulating gene expression (3, 6). In Arabidopsis 
thaliana, there are 21 potential JmjC domain- 
containing histone demethylases (7, 8). Among 
them, the KDM5 subfamily members, includ- 
ing JMJ14 to JMJ18, function as H3K4 deme- 
thylases that preferentially demethylate trimethyl 
H3K4. By contrast, mono- and dimethyl H3K4 
are typically demethylated by the KDM1 family 
members, including three Arabidopsis homo- 
logs of the human lysine-specific demethylase 
1 (LSD1) (9-11). Six members of the KDM3 
subfamily, including JMJ25/IBM1 (INCREASE 
IN BONSAI METHYLATION 1) and JMJ26 to 
JMJ29, function as H3K9 demethylases (11, 12). 
In addition, homologs of the mammalian KDM4 
family (H3K9/H3K36 demethylases) have been 
found in Arabidopsis, including JMJ11/ELF6 
(EARLY FLOWERING 6), JMJ12/REF6 (REL- 
ATIVE OF EARLY FLOWERING 6), and JMJ13, 
but these JMJs demethylate H3K27 with a sub- 
strate preference for trimethyl K27 (13-15). 
Members of the mammalian KDM2 (H3K36 
demethylases) and KDM6 (H3K27 demethylases) 
subfamilies have not been found in land plants 
(6). Several JmjC domain enzymes or a-KG/Fe 
(D)-dependent dioxygenases may mediate H3K36 
demethylation in Arabidopsis (16, 17), but their 
activities have yet to be validated. 
JMJ-mediated histone demethylation requires 
a-ketoglutarate, an obligatory substrate gener- 
ated primarily by the tricarboxylic acid (TCA) 
cycle in the mitochondria matrix (5, 78). a-KG 
is produced by isocitrate dehydrogenase (IDH) 
through oxidative decarboxylation of isocitrate, 
and is subsequently decarboxylated oxidatively 
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by a-ketoglutarate dehydrogenase (a-KGDH) 
to generate succinyl-CoA and CO, (19, 20). 
These two reactions are rate-limiting steps in 
the TCA cycle (19, 20). The KGDH enzyme com- 
plex is composed of three subunits including 
oxoglutarate dehydrogenase (OGDH, E1 sub- 
unit), dihydrolipoamide succinyltransferase 
(DLST, E2 subunit), and dihydrolipoamide 
dehydrogenase (DLD, E3 subunit) (27, 22). OGDH, 
F2, and E3 catalyze o-KG decarboxylation, the 
formation of succinyl-CoA, and dihydrolipoamide 
dehydrogenation, respectively (22). In Arabi- 
dopsis, the E1 subunit OGDH is encoded by 
two genes, OGDH1 and OGDH2 (22). Putative 
OGDH loss-of-function mutants (loss of both 
OGDHI and OGDH2 function) have been reported 
in which levels of total cellular o-KG remained 
unchanged compared with wild type (WT) (22). 
As a result, the role of KGDH in the control of 
the level of total cellular o-KG is still unclear. 
In addition, these putative double mutants ex- _ 
hibited normal growth and development with 
only slight phenotypic changes (22). Biological 
functions for KGDHs other than their role in 
the TCA cycle are not clear. 

Cytosolic a-KG molecules diffuse into the 
nucleus through nuclear pores and serve as 
an obligatory substrate for histone demethylation 
by JMJs (23, 24). a-KG is a rate-limiting substrate 
for the family of o-KG/Fe(ID)-dependent dioxy- 
genase (19, 25, 26). Michaelis constant (Kn) 
values for o-KG of the JMJs examined so far 
including human KDM4, KDM5, and KDM6 
family enzymes are in the range of 6 to 50 uM, 
and the local physiological concentrations of 
a-KG for JMJs are estimated in this range 
(27-29). Therefore, JMJ activities may be sen- 
sitive to fluctuations in nuclear a-KG concen- 
trations. In mouse embryonic stem cells, a 
reduction in the level of intracellular o-KG 
reduces the rate of histone demethylations by 
JMJs (25). The local availability or concentra- 
tion of a-KG may regulate the activity of nuclear 
JMJs. In this study, using the model plant 
Arabidopsis, we found that the TCA cycle- 
associated enzyme KGDH controls the nuclear 
availability of a-KG to regulate histone deme- 
thylations by JMJs. 


KGDH controls a-KG levels and regulates 
growth and development in Arabidopsis 


To explore the metabolic and biological func- 
tions of KGDH, we constructed loss-of-function 
mutants of the El/OGDH subunits of KGDH. 
Two putative ogdhi ogdh2 double mutants with 
a normal phenotype were previously reported 
(22). However, we observed that in one described 
ogdh2 allele (Salk_055824), the putative muta- 
gen, a transferred DNA (T-DNA), is inserted 
downstream of OGDH2 (fig. S1, A, D, and E), 
whereas another described ogdh2 allele (Salk_ 
122458) carries multiple segregating T-DNAs 
(https://Arabidopsis.info/). We first identified 
two loss-of-function ogdhI mutants in which 
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the full-length transcripts of OGDHI were 
eliminated by insertional T-DNAs (fig. S1, A to 
C). Next, we generated two ogdh2 mutants 
using CRISPR/CRISPR-associated protein 9 
(Cas9). A 9-bp deletion in ogdh2-1 removed 
three amino acid residues from the OGDH2 
protein, and a single-base pair insertion frame- 
shifted ogdh2-2 (fig. S1D). 

We found that the single mutants of ogdhI 
and ogdh2 flowered moderately later than WT, 
based on the total number of primary leaves 
formed before flowering (Fig. 1C and fig. S1F). 
We further found that the double mutants of 
ogdh1 ogdh2 flowered very late even under in- 
ductive long-day growth conditions (Fig. 1, A 
and C, and fig S1G), and plastochron growth in 
the ogdh1 ogdh2 mutants were much slower 
than in the WT (fig. SIH). These factors gave 
rise to greatly prolonged “days to flowering” 
(fig. S1G). The late-flowering phenotype of 
ogdh1 ogdh2 was rescued by the introduction 
of WT genomic OGDH1 or OGDH2 fragments 
(fig. S2A), suggesting functional redundancy 
in promoting floral transition. 

To further determine whether OGDH func- 
tions as part of KGDH to control floral transi- 
tion, we examined loss-of-function mutants of 
E2 subunits. The Arabidopsis E2 subunit is 
encoded by two genes (30), E2a (At5G55070) 
and £2b (At4G26910), with very similar protein 
sequences (figs. S3A and S4, A and C). We 
identified loss-of-function mutants in each 
gene, e2a-1 and e2b-1 (fig. S3B). Both e2a and 
e2b mutants, like ogdhI and ogdh2, flowered 
later than WT (Fig. 1E and fig. S3D). An e2a allele 
with a T-DNA insertion in its 3’ untranslated 
region was recently found to flower slightly 
later (30). We failed to obtain an e2a e2b double 
mutant through genetic crossing, likely due to 
an essential function of E2/KGDH in cellular 
respiration. 

In addition to the developmental abnormali- 
ties, we further observed that the regions en- 
compassing the shoot apical meristem in ogdh1 
ogdh2 turned light purple (Fig. 1B), indicating 
that there was an accumulation of anthocya- 
nins. Anthocyanins function as potent antiox- 
idants to scavenge reactive oxygen species and 
other free radicals in vegetative tissues during 
biotic and abiotic stresses (31, 32). We measured 
the levels of total anthocyanins in the seedlings 
of WT, ogdhi, ogdh2, and ogdhi ogdh2, and 
found that in the ogdh1 ogdh2 double mutants, 
anthocyanins accumulated to higher levels 
than in the WT or single mutants (Fig. 1D and 
fig. S2C), revealing that OGDH1 and OGDH2 
function redundantly to inhibit anthocyanin 
biosynthesis under normal growth conditions. 
Moreover, we found that the levels of antho- 
cyanins were higher in both e2a and e2b mu- 
tants relative to WT (Fig. 1F). Thus, OGDH and 
2 genes act to prevent biosynthesis of antho- 
cyanins in Arabidopsis seedlings grown under 
normal conditions. 
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Next, to determine the extent of perturba- 
tion in cellular primary metabolism upon loss 
of OGDH function, we first measured total cellu- 
lar a-KG levels in whole seedlings of WT and 
ogdhi-2 ogdh2-1 (hereafter referred to as ogdhi/ 
2), and found increased a-KG levels (Fig. 1G). 
This shows that OGDH/KGDH plays a primary 
role in controlling the level of cellular a-KG by 
catalyzing its oxidative decarboxylation. Fur- 
thermore, we measured the cellular levels of 
TCA cycle intermediates and found that succi- 
nate (the product of a-KG decarboxylation) 
and malate were greatly decreased in ogdh1/2 
seedlings compared with WT (fig. S5A), suggest- 
ing a reduction in the TCA cycle flux upon a 
functional loss of KGDH. We also found that 
cellular levels of most Calvin cycle and gly- 
colysis intermediates were increased in ogdh1/ 
2 compared with WT. For example, there was 
an ~3-fold increase in sucrose (fig. S5A). This 
indicates that there may be a slowdown of 
glycolysis caused by a reduction in TCA cycle 
flux upon a functional loss of the rate-limiting 
OGDH/KGDH, which is consistent with the 
slowdown of shoot growth and development 
(slower rates of leaf initiation) in the ogdh 
mutants. 

We found that succinate, fumarate, and malate 
were all decreased in e2a seedlings compared 
with WT (fig. S5B), indicating a reduction in the 
TCA cycle flux. Unlike in ogdhI/2, the level of 
total cellular o-KG in e2a was similar to that in 
WT (Fig. 1H). In KGDH-catalyzed oxidative 
decarboxylation of o-KG, the E1/OGDH sub- 
unit catalyzes decarboxylation of o-KG to pro- 
duce succinyl-thiamine pyrophosphate that is 
later converted into succinyl-CoA by the E2 
subunit (33). OGDH can catalyze a-KG de- 
carboxylation in the absence of E2 (33); there- 
fore, this subunit remains active in the e2a 
seedling, accounting for the normal turnover 
of o-KG in e2a. These results together show that 
a disruption of KGDH function causes a re- 
duction in the TCA cycle flux and a slowdown 
of cellular respiration in Arabidopsis seedlings. 


KGDH enters into the Arabidopsis nucleus 


Because o-KG is an obligatory substrate for 
histone demethylation by JMJs (5), elevation 
of total cellular o-KG in ogdh1/2 might give rise 
to misregulation of various genes and conse- 
quent growth and development abnormali- 
ties. However, this is unlikely, because the e2 
mutants exhibit similar phenotypes as ogdh1/2 
but have a normal level of total cellular a-KG 
as that in WT. This rules out the possibility of 
the level of total cellular o-KG in ogdhI/2 ex- 
erts a critical role to regulate gene expression. 
Alternatively, we reasoned that a minor por- 
tion of the TCA cycle-associated KGDH en- 
zymes may enter into the nucleus to directly 
regulate gene expression. 

It has been reported that a minor portion of 
a subset of mitochondria TCA cycle enzymes 
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can move to the nuclei of several types of ma- 
mmalian cells (27, 34), but it is unknown whether 
this occurs in other eukaryotic cells. Although 
the enzymatic activity-associated domains in 
OGDH and E2 are evolutionarily conserved, 
the rest of the protein sequences are not well 
conserved between Arabidopsis and mammals 
(figs. SIA and S4, A and C). Therefore, we in- 
vestigated whether KGDH may enter the nu- 
cleus in Arabidopsis. Using transgenic seedlings 
expressing functional OGDH1:GFP, OGDH2: 
GFP or E2a:GFP (translational fusions driven 
by respective native promoters), we found that 
most of the OGDH and E2 proteins localized 
to mitochondria (labeled by Mitotracker red 
staining), but a minor portion of OGDH1, 
OGDH2, and E2a localized to nuclei (Fig. 2, A 
to D, and fig. S6, A and C). In addition, we 
found that a minor portion of E2b and E3a 
(AT3G17240; tagged by HA) also localized to 
nuclei (Fig. 2, E and F). We further found | 
OGDH- and E2-dependent KGDH activity in 
the nucleus (fig. S6D). Thus, a minor portion 
of the KGDH holoenzymes are located in the 
nuclei of Arabidopsis seedlings. 

To determine how the KGDH holoenzyme 
enters the nucleus, we examined the sequences 
of OGDH, E2, and E3 subunits using NLStra- 
damus (35), and found that there was a nuclear 
localization signal (NLS) in the E2 subunits, but 
not in the OGDH1, OGDH2, or E3 subunits (fig. 
S4A), indicating that KGDH nuclear targeting 
may depend on E2. In addition, we observed 
that E2s from other eukaryotic organisms 
such as green algae, moss, yeast, and fly also 
bear a predicted NLS (fig. S4C). We mutated 
R236, R237, R242, and R244 in the putative 
NLS (fig. S4A) and found that these mutations 
(in mtE2a-1) prevented nuclear accumulation 
of E2a, but not its localization in mitochondria 
(Fig. 2B and fig. S6, A and B). Similarly, simul- 
taneous mutations of K221, $223, K232, and 
R234 in the NLS of E2a (mtE2a-2) prevented 
its nuclear localization (fig. S6A). Furthermore, 
we found that in the e2a mutant, the nuclear 
abundance of OGDH1 was greatly reduced 
(Fig. 21), which was also reduced in e2b (fig. S6E). 

We further found that the mitochondrion- 
localized E2a:GFP proteins with a mutated 
NLS were not able to complement the e2a 
mutant, whereas the E2a:GFP proteins local- 
ized in both nucleus and mitochondrion 
fully rescued the abnormalities of growth and 
development in e2a (fig. S3, E to H). Thus, 
nuclear KGDH is essential for its biological 
functions. 


Light promotes nuclear KGDH targeting 


Because light is the single most important 
environmental factor for plant growth and 
development (36), we explored the light regu- 
lation of KGDH moving into the nucleus in 
early Arabidopsis seedlings. We found that light 
exposure for 6 hours induced an accumulation 
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Fig. 1. Phenotypes in Arabidopsis KGDH-related mutants. (A) ogdhl ogdh2 
double mutants grown in inductive LDs flowered extremely late. Scale bars, 1 cm. 
(B) ogdhl ogdh2 double mutants accumulated anthocyanins in the region 
encompassing the shoot apical meristem (brown-purple tissues indicated by 
arrows). Scale bars, 5 mm. (€) Flowering times of ogdh mutants grown in LDs. 
The total number of primary leaves formed before flowering in 15 plants per line 
was counted. (D) Anthocyanin content in ogdh seedlings. Relative content was 
calculated by (A530 — A657)/g (fresh weight), and three biological replicates 
were performed for each genotype. (E) Flowering times of e2a and e2b mutants 
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grown in LDs. Fifteen plants per line were scored. (F) Anthocyanin content in WT, 
e2a, and e2b seedlings. Three biological replicates were performed for each 
genotype. (G) a-KG levels in WT and ogdhl ogdh2 seedlings. (H) a-KG levels in 
WT and e2a seedlings. Left, diagram illustrating the reactions catalyzed by 
OGDH/E1 and E2 in the KGDH holoenzyme. TPP, thiamine pyrophosphate. In 
(C) to (H), values are means + SD [(C) and (E)] or SD of three biological 
replicates [(D) and (F) to (H)]. Letters in (C) to (F) mark statistically significant 
differences (one-way ANOVA, P < 0.01). In (G) and (H), ***P < 0.001 and ns, 
significant (two-tailed Student's t test). 
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Fig. 2. Light promotes the KGDH holoenzyme entering the nucleus. 0% of nuclear proteins were loaded for Western blotting. Seedlings grown in LDs 


(A) Subcellular localization of OGDH1:GFP and OGDH2:GFP in leaf epidermal 
cells. Mitochondria are labeled by Mitotracker red staining. Scale bars, 5 um. 
(B) Subcellular localization of E2a:GFP and mtE2a:GFP in leaf epidermal cells. In 
mtE2a-1:GFP, four arginine residues in the putative NLS were mutated to 
alanines. Scale bars, 5 um. (€ to F) Analysis of subcellular localization of KGDH 
subunits by immunoblotting. Cytoplasmic (cyt.) and nuclear (nuc.) proteins 
were extracted separately from the seedlings expressing OGDHI1:GFP (C), E2a: 
GFP (D), E2b:GFP (E), or E3:HA (F). Actin and H3 served as cytoplasmic fraction 
and nuclear fraction markers, respectively. A total of 5% of cytoplasmic and 
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of the OGDH1 proteins in nuclei in young seed- 
lings, and that a 12-hour exposure of light in- 
duced more OGDH1 proteins to be localized in 
nuclei (Fig. 2, G and H). In addition, we ob- 
served elevated nuclear accumulation of the 
other two subunits of KGDH, including E2a 
and Ea, upon light exposure (Fig. 2, J and K). 


Nuclear KGDH functions to inhibit various 
histone demethylations 


The catalytic activities of dioxygenases such as 
JMJs are dependent on the cosubstrate o-KG 
(4). KGDH consumes o-KG and may function 
to inhibit histone demethylations by JMJs. 
Therefore, we investigated whether loss of 
OGDH/KGDH function affects histone methyl- 
ations. We found that H3K4me3, H3K9me?2, 
H3K27me3, and H3K36me3 levels were all 
greatly decreased in the ogdhI/2 seedlings 
relative to WT, whereas in ogdhi and ogdh2 
single mutants, the levels of these marks were 
moderately reduced or similar compared with 
WT (Fig. 3, A and B). In addition, we found 
that loss of OGDH1/2 function had no effect 
on the level of total histones modified with 
H3K4mel, H3K4me2, or H3K27mez2 (Fig. 3, 
A and B). These results suggest that OGDH1 
and OGDH2 redundantly inhibit global his- 
tone demethylations of H3K4me3, H3K9me2, 
H3K27me3, and H3K36me3. 

H3K4me3, H3K9me2, and H3K27me3 are 
known to be demethylated only by JMJs in Ara- 
bidopsis, whereas demethylation of H3K4mel1 
and H3K4me?2 is catalyzed by the Arabidopsis 
homologs of the human LSD1 H3K4 demethylase 
that does not use a-KG as a substrate (9-11, 13). 
Specifically, the Arabidopsis H3K27 demethyl- 
ases, including ELF6, REF6, and JMJ13, prefer- 
entially demethylate H3K27me3; KDM5 family 
members such as JMJ14 preferentially deme- 
thylate H3K4me3; and KDM3 subfamily members 
such as IBM1 demethylate H3K9me2 (9, 17, 13, 15). 
In addition, H3K36me3 seems to be demethylated 
by JMJs (6). The decreases of H3K4me3, 
H3K9me2, H3K27me3, and H3K36me3, but 
not H3K4mel, H3K4me2, or H3K27me2, on 
total histones upon loss of OGDH/KGDH func- 
tion suggest that KGDH functions to inhibit 
a-KG-dependent histone demethylations cata- 
lyzed by JMJs. 

To assess genome-wide changes of H3K27me3 
upon loss of OGDH function, we performed 
chromatin immunoprecipitations with anti- 
H3K27me3, followed by DNA sequencing 
(ChIP-seq), using WT and ogdh1/2 seedlings. 
Consistent with previous findings (37, 38), 
H3K27me3 occurred mainly in gene bodies 
in the Arabidopsis genome (Fig. 3C and fig. 
S7A). Furthermore, we observed that loss of 
OGDHI/2 function caused an overall reduc- 
tion of H3K27me3 in gene bodies (Fig. 3C and 
fig. S7B), corroborating the strong reduction of 
H3K27me3 on total histones in ogdh1/2 rela- 
tive to WT (Fig. 3B). 
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We further examined the levels of histone 
methylations on total histones extracted from 
the e2a seedlings and found that, like in ogdh1/2, 
the levels of H3K4me3, H3K9me2, H3K27me3, 
and H3K36me3 were strongly decreased upon 
loss of £2a function (Fig. 3D). Similarly, the 
levels of these histone marks were decreased 
in the e20 seedlings (fig. S6F). These results 
further show that KGDH acts to inhibit his- 
tone demethylations by JMJs. Because the level 
of total cellular o-KG remains unchanged in 
e2a relative to WT (Fig. 1H), the reduced levels 
of methylated H3K4, H3K9, H3K27, and H3K36 
in e2a are obviously not caused by a fluctuation 
in the level of total cellular o-KG. Given the 
E2-dependent nuclear targeting of KGDH and 
the biochemical role of KGDH in the oxidative 
decarboxylation of a-KG, we reasoned that an 
increase in the level of nuclear o-KG in e2a 
(and e2b and ogdhI/2) may have caused the 
reductions in methylated histones through 
elevated availability of local o-KG to JMJs. 
Indeed, we found that the nuclear E2a, but not 
mitochondrial E2a, was responsible for the 
inhibition of JMJ-mediated histone demethyl- 
ations (Fig. 3D). Together, these results sug- 
gest that nuclear KGDH catalyzes oxidative 
decarboxylation of o-KG to reduce its local 
concentration and thus inhibit JMJ activities. 


Nuclear KGDH regulates environment- 
responsive gene expression 


To investigate the role of OGDH1/2 and E2a in 
transcriptome regulation, we performed RNA 
sequencing (RNA-seq) and found that 1008 and 
587 genes were up-regulated, whereas 1572 and 
791 genes were down-regulated in ogdhI/2 
and e2a relative to WT, respectively [cutoff value: 
1.8-fold; false discovery rate (FDR) < 0.05)] (Fig. 
3, E and F; fig. S8, A to C; and table S1). A total 
of 78% of the up-regulated genes (422 of 544) 
and 89% of the down-regulated genes in e2a 
(702 of 791) were also up- or down-regulated in 
ogdhi/2, respectively (Fig. 3, E and F, and fig. 
S8D), consistent with the idea that OGDH (E1) 
and E2 function as part of the KGDH holo- 
enzyme to regulate nuclear gene expression. 
We examined the chromatin features of 
KGDH-regulated loci by comparing them with 
H3K27me3- or H3K4me3-modified loci in 
Arabidopsis seedlings, because H3K4me3 and 
H3K27me3 are linked with transcriptional ac- 
tivation and repression, respectively (3, 39). A 
total of 257 up-regulated genes in the ogdh1/2 
seedlings were H3K27me3-bearing loci, and 
635 down-regulated genes in ogdhI/2 were 
H3K4me3-bearing loci (Fig. 3, E and F, and 
table S2), revealing significant overlap. Further- 
more, we found that in most of the 257 up- 
regulated genes in the ogdh1/2 seedlings, the 
levels of repressive H3K27me3 were reduced 
compared with WT, particularly in the gene-body 
regions where H3K27me3 is enriched (Fig. 3H), 
suggesting that the nuclear OGDH/KGDH inhib- 
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its JMJ-mediated demethylation of H3K27me3 
to repress the expression of these genes. 

To understand the biological functions of 
OGDH-regulated genes, Gene Ontology (GO) 
enrichment analysis was conducted. Most of 
top over-represented or enriched GO terms 
of biological process were related to responses 
to environmental stimuli (biotic and abiotic 
stimuli) and stress hormones including absci- 
sic acid and jasmonic acid (fig. SSE). The GO 
term “response to red and far red light” was 
enriched in the OGDH/KGDH regulon (fig. 
S8E), consistent with light promoting KGDH 
movement into the nucleus (Fig. 2, Gand H). 
Furthermore, the expression of genes encoding 
biosynthesis of anthocyanins (stress marker 
metabolites) was enriched in the GO analysis 
(fig. S8E), consistent with anthocyanin accumu- 
lation upon functional loss of KGDH in the 
ogdhi/2 and e2 mutants (Fig. 1, D and F). 


Next, we investigated whether the regula- | 


tory genes in the floral transition or anthocya- 
nin biosynthesis may be misregulated upon 
functional loss of KGDH. The cold-responsive 
MADS-box transcription factor FLOWERING 
LOCUS C (FLC) is a central floral repressor (40). 
We found that FLC expression was derepressed 
in both ogdhI1/2 and e2a (with a >4-fold in- 
crease in ogdhi/2) (Fig. 3, G and I). The late- 
flowering phenotype in ogdh1/2 was largely 
rescued by an flc mutation (fig. S2D). Further- 
more, we found that the expression of the 
master regulator of anthocyanin biosynthesis, 
PRODUCTION OF ANTHOCYANIN PIGMENT 
1 (PAPI) (31, 32), was greatly derepressed in 
both ogdhI/2 and e2a (~20-fold increase in 
ogdhi/2) (Fig. 3, G and I). The stress-responsive 
MYB-domain transcription factor PAP1 acti- 
vates the expression of structural genes in 
anthocyanin biosynthesis (37, 32). We found 
that a papI mutation largely eliminated the 
anthocyanin accumulation caused by loss of 
OGDHT1/2 function (Fig. 3J and fig. S8F), re- 
vealing that OGDH1 and OGDH2 repress PAPI 
expression to inhibit anthocyanin biosynthesis. 

In an e2a line expressing only mitochondria- 
localized E2a, we found that the derepression 
of neither FLC nor PAPI in e2a was rescued 
(Fig. 31). This confirms that the derepression 
of both genes is caused by the loss of nuclear 
E2/KGDH activity. We thus conclude that the 
nuclear KGDH, but not the mitochondria 
KGDH alone, is essential for transcriptome 
regulation. 


OGDH interacts with JMJs in the nucleus 


We found that nuclear KGDH functions to 
inhibit histone demethylations by JMJs. To 
address how nuclear KDGH may inhibit JMJ 
activities, we first investigated whether OGDH/ 
KGDH may interact with individual JMJs. 
Using yeast two-hybrid analysis, we found that 
the C-terminal regions of OGDH1 and OGDH2 
(composed of a conserved pyrimidine-binding 
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Fig. 3. Nuclear KGDH A 
functions to inhibit histone 
demethylations and regulate Hskames 
global gene expression. (A and 
B) Levels of H3 methylated at H3K4me2 
various lysine residues in the 

H3K4me1 


indicated Arabidopsis seedlings 
as revealed by immunoblotting. 
Total histones extracted from the 
seedlings were analyzed. Shown 
in (A) is a set of representative 
blots from three biological 
epeats. Relative levels of H3 H3 
methylated at K4, K9, K27, or 

K36 are shown in (B). Values are 
means + SD of three biological 
eplicates. (C) Metagene plots of Cc 
H3K27me3 enrichment in WT 


H3K36me3 


OWT 


° 


° 


5 


9° 


Relative fold change 


5 


D 
1 


and ogdh1/2 seedlings. RPGC, 
eads per genomic content; TTS, 
transcription termination site. 
(D) Levels of H3 methylated at 
K4, K9, K27, or K36 in e2a- 
related seedlings, as revealed by 
immunoblotting. Relative levels 
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are shown, and values are 
means + SD of five biological 
replicates. (E and F) Venn 
diagrams of the genes up- E 
regulated (E) or down-regulated 
(F) in ogdhl/2 and/or e2a 
seedlings overlapped with 
H3K27me3- or H3K4me3- 
occupied loci. H3K4me3- 
modified genes (at a seedling 
stage) are extracted from 
GSE185121. Overlapping P values 
(Fisher's exact test) are indi- 
cated. (G) Validation of RNA-seq 
data by qPCR. Plotted are fold 
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seedlings. Transcript levels were first normalized to the constitutively expressed TUBULIN2 (TUB2), and relative expression to WT is shown. Values are means + SD 
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domain and a conserved C-terminal domain) 
interacted with the JmjN domain-bearing 
N termini of the H3K27 demethylase family 
members, including ELF6, REF6, and JMJ13 
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(Fig. 4A and fig. S9A). Next, we conducted 
coimmunoprecipitations (co-IPs) using F; seed- 
lings expressing functional OGDH1:HA and 
ELF6:Flag (47), and found that OGDH1:HA 
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was immunoprecipitated by ELF6:Flag (Fig. 
4B). Moreover, using epitope-tagged control 
proteins, we did not find any association of 
Flag with HA, ELF6 with HA, or OGDH1 
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Fig. 4. OGDH subunits interact with A Dehydrogenase = a = — aD 
JMJs and function to inhibit JMJ- OGDH1 PBD aC AD:OGDH1-C AD:OGDH1-C ; 
mediated histone demethylations. oGbH1-c <-> ct © © | BD eo ow 
(A) OGDHI and OGDH2, through pore seeiesng erie O AD:OGDH2-C eo ole 
C terminus regions (OGDH-C), inter- ELF6 AD:OGDH1-C An 
acted with N terminus JmjN-bearing ELFe-N @ BD:REF6-N BD:REF6-N eo 
regions of ELF6, REF6, and JMJ14 in ne sl en © © _ AD eo 
yeast cells. Left: schematic illustra- eer ee an eee 
tions of proteins. N, conserved SEN FYRN FYRC Soin © SOU fo om 
N-terminal motif in OGDH1; PBD, JmJ14 _ ud 

pyrimidine-binding domain; C, JmJ14-N ian BD:JMJ14-N 
C-terminal domain. OGDHI-C (amino 
acids 601 to 1017) and OGDH2-C Input IP Cc Input IP D Input IF 
(amino acids 601 to 1025) were fused OGDH1:HA + + + + OGDH1:Flag + + os OGDAIE A: he a. a oF 
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acids 1 to 60), REF6-N (amino acids Anti-HA Anti-Flag a Anti-HA a 
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(J) and (K) are means + SD of three biological replicates analyzed for statistical significance by two-tailed Student's t test. **P < 0.01; ns, not significant. 


with Flag (fig. S9, F to H). These results 
confirm that OGDH1 indeed associates with 
the ELF6 H3K27 demethylase in seedlings. 
Furthermore, we performed bimolecular 
fluorescence complementation (BiFC) assays 
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through transient coexpression of OGDHI1- 
nEYFP and ELF6-cEYFP in tobacco epider- 
mal cells, and found that OGDH1 physically 
associated with ELF6 in the nucleus, but not 
in the cytoplasm or mitochondria (Fig. 4E). 
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Similarly, OGDH1 was found by co-IP to be 
in a complex with REF6 (Fig. 4C and fig. 
S9I), using F, seedlings expressing func- 
tional REF6:HA and OGDHI1: Flag (fig. S9, B 
and C). These results together show that the 
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nuclear OGDH/KGDH is in a complex with 
several H3K27 demethylases. 

The N-terminal JmjN domains exist widely 
in most JMJs in both plants and animals, and 
are involved in protein-protein interactions 
(42, 43). We further found that the N-terminal 
JmjN domain-bearing region of the H3K4 de- 
methylase JMJ14 (also known as PKDM’B) inter- 
acted with the C-terminal regions of OGDH1 
and OGDH2 in yeast cells (Fig. 4A). Moreover, 
we conducted co-IPs using the F, seedlings 
expressing functional OGDH1:HA and JMJ14: 
Flag (44), and found that OGDH1 was immuno- 
precipitated by JMJ14:Flag. Thus, OGDH1 is 
indeed in a complex with JMJ14 in seedlings 
(Fig. 4D), consistent with OGDH/KGDH func- 
tioning to inhibit global H3K4me3 demethyl- 
ation. Similarly, OGDH1 was found to be in a 
complex with the H3K9 demethylase IBM1 
using yeast two-hybrid analysis and co-IPs 
with the F, seedlings expressing the functional 
IBM1:Flag (45) and OGDH1:GFP (Fig. 4F and 
fig. S9D), consistent with the function of OGDH/ 
KGDH in inhibiting H3K9me2 demethylation. 


Association between mammalian KGDH 
and JMJs 


Most JMJs are evolutionarily conserved in the 
eukaryotes bearing histones with tails (3, 7, 46). 
In addition, the C-terminal regions of OGDH 
subunits are composed of a pyrimidine-binding 
domain and a C-terminal domain, which are 
highly conserved across eukaryotes (fig. SIA). 
Because a minor portion of KGDHs can be tar- 
geted into the nuclei of several types of mam- 
malian cells (21, 47, 48), we investigated whether 
the association of nuclear OGDH/KGDH with 
JMJs may be conserved in mammals. Using 
yeast two-hybrid assays, we found that C- 
terminal regions of OGDH1 from human 
and mouse indeed interacted with the JmjN- 
bearing N termini of the human and mouse 
H3K9/H3K36 demethylase KDM4<A, respective- 
ly (fig. S9E). Furthermore, we coexpressed the 
human OGDH1:Flag and KDM4A:HA in mam- 
malian human embryonic kidney (HEK) 293F 
cells and found that these two proteins indeed 
associate with each other by co-IPs (Fig. 4G). 
There was no association of the human OGDH1 
with HA or KDM4<A with Flag (fig. S9, J and K); 
therefore, it is possible that the nuclear KDGH 
may interact and function together with JMJs 
to control histone demethylations in mammals 
in addition to plants. 


JMJ-associated KGDH catalyzes oxidative 
decarboxylation of o-KG in the nucleus 


To determine the biochemical function of 
nuclear KGDH, we immunoprecipitated KGDH 
from the nuclei isolated from the transgenic 
Arabidopsis seedlings expressing the functional 
OGDH1:Flag and further examined KGDH 
activity. We found that the immunoprecipitated 
KGDH complex from nuclei had potent oxidative 
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decarboxylation activity (Fig. 4H). Next, we 
investigated whether active JMJs are co-IPed 
with the nuclear KGDH by measuring the 
demethylation activities of nuclear KGDH im- 
munoprecipitations, and found that these preci- 
pitates demethylated H3K4me3 and H3K27me3 
(substrates of JMJs), but not H3K27mel or 
H3K4mel (Fig. 41). In addition, these activi- 
ties were inhibited by succinate (fig. SIOA), a 
known inhibitor of JMJ activity (49). These 
results reveal that active JMJs indeed associate 
with the KGDH holoenzyme in the nucleus. 


Elevated KGDH in the nucleus inhibits histone 
demethylation by JMJs 


To further confirm that nuclear KGDH acts to 
inhibit JMJ activities, we fused a potent nu- 
clear localization signal from Simian Virus 40 
(SV40) (50) at the C terminus of OGDH2 
(OGDH2:NLSgy4o:Flag). The recombinant 
OGDH2:NISsyo:Flag was transformed into 
Arabidopsis. More OGDH2:NLSgyao:Flag pro- 
teins were targeted to the nucleus compared 
with OGDH2:Flag (fig. S10B), which had mini- 
mal effects on the level of total cellular o-KG 
(fig. S1OC). We found that elevated nuclear 
accumulation of OGDH2 in the OGDH2:NISsy40: 
Flag line resulted in increased levels of H3K4me3, 
H3K9me2, H3K27me3, and H3K36me3 (prefer- 
ential substrates of JMJs), but not of H3K4mel1 
or H3K27me2 (Fig. 4J). These results revealed 
that elevated nuclear accumulation OGDH2 
(KGDH) inhibits histone demethylations by 
JMJs, reflecting that the nuclear KGDH holo- 
enzyme, complexed with JMJs, catalyzes oxi- 
dative decarboxylation of a-KG to reduce its 
availability to JMJs. 

Targeting KGDH into the nucleus is pro- 
moted by light, so we examined the levels of 
methylated H3 histones in early seedlings 
exposed to light for 12 hours [Zeitgeber time, 
12 hours after lights on (ZT12)]. Compared 
with predawn, light exposure resulted in appa- 
rent increases, including in H3K4me3, H3K9me2, 
H3K27me3, and H3K36me3, among which the 
elevations of methyl H3K9, H3K27, and H3K36 
were fully dependent on the activity of OGDH/ 
KGDH (Fig. 4K). Thus, light promotes KGDH 
targeting into the nucleus to inhibit histone 
demethylations by JMJs in early seedlings. 

In summary, the results thus far reveal that 
the nuclear KGDH holoenzyme, in complex 
with individual JMJs, catalyzes oxidative 
decarboxylation of o-KG to reduce its local 
availability to JMJs (on target chromatin), re- 
sulting in an inhibition of histone demethyl- 
ations by JMJs in the Arabidopsis genome. 


Nuclear KGDH occupies regions of target loci 
where histone methylations occur 


We explored the genome-wide occupancy of 
OGDH1 by ChIP-seq using OGDH1:Flag seed- 
lings. We found that 3528 loci were occupied by 
OGDHI:Flag across the Arabidopsis genome 
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(Fig. 5, A to C, and fig. S11A). OGDHI1 was en- 
riched in gene-body regions of most target loci; 
in addition, it bound to 5’ transcribed regions 
proximal to the transcription start site (TSS) at 
part of the target loci (Fig. 5, A and B). Histone 
methylations, including H3K9me2, H3K27me3, 
and H3K36me3, are known to preferentially 
occur in gene-body regions in the Arabidopsis 
genome, whereas H3K4 trimethylation occurs 
in 5’ transcribed regions with a peak near the 
TSS (39). Thus, the spatial pattern of genome- 
wide OGDHI occupancy is consistent with its 
function in inhibiting histone demethylations of 
H3K4me3, H3K9me2, H3K27me3, and H3K36me3 
by JMJs. We further found that more than two- 
thirds of the OGDH1-bound loci (2454 of 3528) 
were marked with H3K4me3, and about one- 
fourth of these loci (915) were marked with 
H3K27me3 in Arabidopsis seedlings (Fig. 5C). 
The rest of OGDH1-bound loci may bear other 


lysine methylations such as H3K9me2 or | 


H3K36me3. Together, these results reveal that 
nuclear OGDH/KGDH occupies 5’ transcribed 
and/or gene-body regions of target loci where 
histone methylations occur. 

Next, we examined the changes of H3K27me3 
at the 915 OGDH1-occupied loci (with H3K27me3 
marks) in ogdhI/2 relative to WT, which consist 
of 229 H3K27me3-bearing genes and 686 loci 
with both H3K4me3 and H3K27me3 (Fig. 5C). 
We found that upon loss of OGDH1/2 function, 
most of the 229 H3K27me3-bearing loci occu- 
pied by OGDH1 displayed an overall reduction 
of H3K27me3 in gene-body regions where 
H3K27me3 was enriched, suggesting that 
OGDH/KGDH functions to inhibit H3K27 de- 
methylation at these loci. At the 686 loci with 
H3K27me3 and H3K4me3, about half exhibited 
an H3K27me3 reduction in gene-body regions, 
whereas about one-third displayed a moderate 
increase in H3K27me3 (Fig. 5E). This indicates 
that at the loci with multiple methylation 
marks, OGDH/KGDH-mediated inhibition of 
JMJ-dependent histone demethylations may 
exert a complex effect on local chromatin 
features. 

To further determine the role of OGDHI7 in 
transcriptional regulation of target genes, we 
compared the list of OGDH1-bound loci with 
the list of differentially expressed genes (DEGs) 
with a cutoff value of 1.8 in ogdhI/2 (versus 
WT), and found that 174 of the 1008 up- 
regulated genes (P = 4.8 x 107°, Fisher’s exact 
test) and 263 of the 1572 down-regulated genes 
(P = 5.5 x 10°") were bound by OGDHI (Fig. 
5D and table S4). Moreover, comparing the list 
of OGDH1-bound loci with DEGs with a cutoff 
value of 1.5 (in ogdhI/2 versus WT) revealed 
that a total of 352 up-regulated genes and 402 
down-regulated genes were bound by OGDH1 
(fig. S11B and table S4). These results show 
that the chromatin-bound OGDH1 is required 
to promote or repress the expression of various 
genes in Arabidopsis. 
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Fig. 5. OGDH/KGDH couples with A 
JMJs to regulate histone demethyla- 3.0 
tions and gene expression in the Ara- 3 
bidopsis genome. (A and B) Metagene 2 
plot (A) and heatmap (B) showing “20 
OGDHI1:Flag occupancy at target loci in 15 
the Arabidopsis genome. (€ and D) 

Overlapping of the OGDH1:Flag-bound 1.0 
loci with H3K4me3- and/or H3K27me3- 

modified loci (C) or the genes misregu- 

lated in ogdh1/2 (D). Overlapping P E . 


values (Fisher's exact test) are indicated. 
(E) Heatmaps showing H3K27me3 
changes at the OGDH1-bound loci with 
H3K27me3 (top) or H3K27me3 and 
H3K4me3 (bottom) in ogdh1/2 (versus 
WT). Blue color indicates a reduction of 
H3K27me3 in ogdh1/2 relative to WT (at 
a seedling stage). (F) ChIP-qPCR analy- 
sis of co-occupancy of OGDHI1:Flag and 


OGDH1 loci with H3K27me3 only 
229) 


JMJ14:Flag at ERD5, TTL3, and DRMI. Es : 
Top: genome browser views of OGDH1: = i 
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drawings of gene structures. Arrows z = 
indicate TSS and gray bars indicate 8 


ChIP-examined regions. Total chromatin 
was extracted from the seedings 
expressing the functional OGDHL:Flag 
and JMJ14:Flag. Relative fold enrich- 
ments of OGDH1:Flag or JMJ14:Flag 
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KGDH and JMJs regulate local histone 
demethylations and gene expression 

To explore how nuclear KGDH functions to- 
gether with JMJs at individual target loci to 
regulate their expression, we selected six 
OGDH1-bound loci modified with H3K4me3 
or H3K27me3 for further examination. First, 
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using ChIP coupled with quantitative polymer- 
ase chain reaction (ChIP-qPCR), we validated 
the occupancy of OGDH1:Flag in gene-body 
regions at three environment-responsive loci 
marked by H3K4me3, including AT1G28330 
(DORMANCY-ASSOCIATED PROTEIN 1; DRM1), 
AT2G42580 (TETRATRICOPETIDE-REPEAT 
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THIOREDOXIN-LIKE 3; TTL3), and AT3G30775 
(EARLY RESPONSIVE TO DEHYDRATION 5; 
ERD5) (Fig. 5F and table S3). Given the asso- 
ciation of nuclear KGDH with the H3K4 de- 
methylase JMJ14, we investigated whether 
JMJ14: may be coenriched at the OGDH1-bound 
regions. Indeed, we found that at TTL3 and 
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ERD5, JMJ14 and OGDHI co-occupied 5’ trans- 
cribed regions and the middle of gene bodies, 
whereas at DRM], these proteins co-occupied a 
5’ transcribed region (Fig. 5F). 

We further found that upon loss of OGDH1/2 
function, levels of H3K4me3 in the regions co- 
occupied by OGDH1 and JMJ14 were decreased 
at DRM1, TTL3, and ERDS (fig. SIC), suggest- 
ing that OGDH (KGDH) acts to inhibit JMJ14- 
mediated H3K4 demethylation. In addition, we 
found that upon loss of JI/J14 function, the 
expression of these three genes were all up- 
regulated (Fig. 5G), revealing that JMJ14- 
mediated demethylation of H3K4me3 (an 
active mark) represses their expression. Further- 
more, we found that loss of KGDH function in 
either ogdh1/2 or e2a resulted in repression of 
these three genes (Fig. 5H), revealing that ele- 
vated levels of local a-KG at these loci function 
to repress their expression. This is consistent 
with the reduced levels of H3K4me3 at these 
loci in ogdhi1/2 (fig. S11C). Together, these re- 
sults reveal that KGDH reduces the local avail- 
ability of o-KG to inhibit JMJ14-mediated H3K4 
demethylation at DRM1, TTL3, and ERD5 and 
thus promote their expression. 

Next, we examined the co-regulation of KGDH 
with an H3K27 demethylase at three H3K27me3- 
marked loci, including AT2G34650 (PINOID; 
PID), PHYTOCHROME INTERACTING FACTOR 
4 (PIF4; encoding a red light-responsive trans- 
cription factor; ref 36), and MYB DOMAIN 
PROTEIN 111 (MYBI111). MYBI11 is a Key trans- 
cription factor that activates the expression of 
early structural genes in anthocyanin biosyn- 
thesis and functions together with the master 
regulator, PAPI, to activate anthocyanin bio- 
synthesis in land plants (57). We first validated 
that OGDHI1 was enriched in the gene-body 
regions of PID, PIF4, and MYBI1II (Fig. 51). 
Given the physical association of OGDH1 with 
the H3K27 demethylase ELF6 in the nucleus, 
we further investigated with ChIP whether 
ELF6 may co-occupy with OGDH1 at PID, PIF4, 
and MYBIII using the seedlings expressing a 
functional ELF6:GFP (fig. SITE). We found that 
ELF6 was indeed enriched in the gene-body 
regions where OGDH1I binds to at these loci 
(Fig. 51). ELF6 is typically recruited to chromatin 
through its DNA-binding domains, association 
with a DNA-binding protein, and/or recogni- 
tion of methyl histones (41, 52). We reasoned 
that the KGDH holoenzyme may be recruited 
to PID, PIF4, and MYBII11 through the ELF6 
protein bound to the chromatin regions at these 
loci. Indeed, we found that OGDH1 enrichment 
at these loci was dependent on ELF6 (Fig. 5D, 
revealing that ELF6 recruits OGDH/KGDH or 
stabilizes its association to the gene-body chro- 
matin at these loci. This indicates that the 
enrichment of OGDH/KGDH at target chro- 
matin regions depends on its coupled JMJs. 

ELF6 catalyzes demethylation of H3K27me3, 
a repressive chromatin mark, to promote target 
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gene expression. We found that ELF6 was 
required for the expression of PID, PIF4, and 
MYBII1I (Fig. 5J). We further found that, in 
contrast to loss of ELF6 function, loss of KGDH 
function in either ogdhi/2 or e2a resulted in 
derepression of these three genes in seedlings 
(Fig. 5K), revealing that the elevated availability 
of local a-KG at these loci functions to promote 
their expression. Consistent with these find- 
ings, we found that the levels of H3K27me3 in 
the regions co-occupied by OGDH1 and ELF6 
at PID, PIF4, and MYBIII were decreased upon 
loss of OGDH1/2 function (fig. S11D). Together, 
these results reveal that KGDH in the chromatin- 
bound ELF6-KGDH assembly consumes local 
a-KG to inhibit ELF6-mediated H3K27 deme- 
thylation and thus repress PID, PIF4, and 
MYBI1II expression. 

ELF6 binds to 5’ transcribed and gene-body 
regions at FLC to mediate H3K27 demethyl- 
ation, and thus promote FLC expression and 
inhibit the transition to flowering (47). Genome- 
wide analysis of OGDH1:Flag occupancy showed 
that it was enriched in the 5’ transcribed re- 
gion and gene-body regions at FLC at the 
seedling stage, which was validated by ChIP- 
qPCR (fig. SIF). These results together suggest 
that KGDH, in association with ELF6, reduces 
the level of local o-KG at FLC to inhibit H3K27 
demethylation by ELF6 and thus repress FLC 
expression and promote the floral transition 
in Arabidopsis. 

In summary, chromatin-bound OGDH (KGDH) 
catalyzes a-KG decarboxylation, resulting in a 
reduction in its local availability to the KGDH- 
coupled JMJs and consequent inhibition of 
histone demethylations by JMJs, to regulate 
the expression of various target loci. Because 
different histone methylation marks are often 
read and interpreted by discrete effectors to 
repress or activate gene expression, the chromatin- 
bound KGDH can promote or repress target 
gene expression, depending on its coupled JMJs 
on target chromatin. 


KGDH interacts with ELF6 to repress 
anthocyanin biosynthesis 


To uncover the detailed molecular mechanism 
for KGDH regulation of PAP expression and 
anthocyanin biosynthesis, we first conducted 
ChIP and found that both OGDH1:HA and 
OGDH2:GFP bound to PAPI chromatin in 
gene-body regions in seedlings (sampled at ZT6) 
(Fig. 6, A and B). PAPI chromatin bears the 
repressive H3K27me3 marks under normal 
growth conditions (table $2). We found that 
loss of OGDH1/2 function resulted in an ap- 
parent reduction of H3K27me3 in the gene- 
body regions where both OGDH1 and OGDH2 
bind (Fig. 6C). This suggests that the o-KG 
decarboxylase OGDH functions to inhibit H3K27 
demethylation on PAPI chromatin. 

Light promotes KGDH nuclear targeting. 
We found that a 6-hour light exposure led to 
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increased OGDH1 binding to PAPI chroma- 
tin in young seedlings (ZT6 versus predawn) 
(fig. S12A). Moreover, we found that light ex- 
posure caused a strong increase of H3K27me3 
specifically at the gene-body regions where 
OGDH1 binds, whereas only slight changes 
in H3K27me3 were observed in ogdhI/2 (ZT6 
versus predawn) (fig. S12B). This reveals that 
light-induced elevation in the enrichment of 
OGDHI (and likely OGDH2) causes an inhibi- 
tion of H3K27 demethylation at PAPI, reinforc- 
ing the notion that OGDH/KGDH consumes 
a-KG around target chromatin regions to in- 
hibit H3K27 demethylation by JMJs at PAPI. 

Next, we found that the H3K27 demethylase 
ELF6 (ELF6:GFP) was greatly enriched in the 
gene-body regions where both ODGH1 and 
OGDH2 bind (Fig. 6D). We further found that 
loss of ELF6 function, in contrast to loss of 
OGDHTI/2 function, gave rise to strongly elevated 
levels of H3K27me3 in the gene-body regions 
bound by ELF6 and OGDH1/2 (Fig. 6C). Thus, 
the OGDH1/2-coupled ELF6 catalyzes H3K27me3 
demethylation on PAPI chromatin. The loss of 
ELF6 function led to moderately elevated 
levels of H3K27me3 in the regions upstream 
of PAPI gene body where ELF6 was not stably 
enriched (Fig. 6C), which may be attributed to 
a loss of transient ELF6 association or spread- 
ing of elevated H3K27me3 from the gene body. 

We compared the levels of H3K27me3 at the 
gene-body regions of PAPI in WT, ogdh1/2, 
and elf6 ogdh1 ogdh2 and found that the de- 
crease of H3K27me3 in ogdhI/2 was rescued 
by loss of ELF6 function (Fig. 6C). Levels of 
H3K27me3 in the OGDH1/2-bound regions 
at PAPI were moderately lower in elf6 ogdh1 
ogdh2 than in elf/6, suggesting that there may 
be another H3K27 demethylase functioning 
at PAPI. Indeed, we found that the ELF6 homolog 
JMJ13, a partner of the nuclear KGDH, was 
also enriched in PAPI gene-body regions, like 
ELF6 and OGDH1/2 (fig. S12C). This indicates 
that JMJ13 may partly mediate H3K27 demethyl- 
ation at PAPI, particularly in ogdh1/2 (with 
elevated o-KG). 

We further determined the role of the 
KGDH-ELF6 assembly in PAPI regulation and 
anthocyanin biosynthesis. Consistent with the 
elevated levels of H3K27me3 at PAPI in elf6, 
PAPI expression was further repressed upon loss 
of ELF6 function (Fig. 6E). Moreover, the great 
derepression of PAPI upon loss of OGDH1/2 
function was largely suppressed by loss of 
ELF6 function in the elf6 ogdhI ogdh2 seed- 
lings, and consequently anthocyanin accumu- 
lation was largely eliminated in this triple 
mutant (Fig. 6E and fig. S$12D). These results 
reveal that KGDH-dependent PAPI repression 
mainly occurs through an inhibition of ELF6- 
mediated demethylation of H3K27me3 on 
gene-body chromatin at PAPI. 

Thus far, the results have revealed that in 
response to light exposure, KGDH enters the 
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Fig. 6. KGDH couples with ELF6 to inhibit local H3K27 demethylation by 
ELF6 at PAPI to repress its expression. (A and B) ChIP-qPCR analysis of 
OGDHIL:HA (A) and OGDH2:GFP (B) enrichment on PAPI chromatin. At bottom in 
(A) is a schematic drawing of the PAPI gene structure. Arrow indicates TSS and 
gray bars indicate ChIP-examined regions. Relative fold enrichments of OGDH1: 
HA or OGDH2:GFP over a background control (WT) at examined regions are 
presented. (C) ChIP analysis of H3K27me3 status on PAPI chromatin in the 
indicated seedlings. Levels of genomic fragments of each examined region 
(immunoprecipitated by anti-H3K27me3) were quantified and normalized to 
“input DNA.” (D) ChIP analysis of ELF6:GFP enrichment on PAPI chromatin. 
Shown are relative fold enrichments of ELF6:GFP over a background control. 
(E) Transcript levels of PAPI in WT, elf6, ogdh1/2, and elf6 ogdh1/2 seedlings. 
The transcripts were quantified by qPCR and normalized to TUB2. For (A) to (E), 
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samples were harvested at ZT6, and values are means + SD of three biological 
replicates. Letters in (E) mark statistically significant differences (one-way 
ANOVA, P < 0.01). (F) Working model for the dynamic control of histone 
demethylations by nuclear KGDH. Environmental signals such as light promote a 
minor portion of cytoplasmic KGDH entering the nucleus. KGDH forms an 
assembly with chromatin-bound JMJs, catalyzes a-KG decarboxylation, and thus 
limits its local availability to the JMJs coupled with KGDH, resulting in an 
inhibition of histone demethylations in target chromatin regions to regulate gene 
expression. The conversion of NAD* to NADH in a-KG decarboxylation may 
reduce the local NAD* pool, which likely has an effect on NAD*-dependent 
histone modifications. In addition, succinyl-CoA is likely to be used as a substrate 
for histone succinylation. TCA cycle intermediates are indicated as follows: Cit, 
citrate; OAA, oxaloacetate; SUC, succinate; and SUC-CoA, succinyl-CoA. 


nucleus and interacts with ELF6. The KGDH- 
ELF6 assembly is enriched at the gene-body 
chromatin of PAPI, where KGDH catalyzes 
a-KG decarboxylation. Reduction of local o-KG 
availability inhibits H3K27 demethylation by 
ELF6 on the gene-body chromatin at PAPI. 
This results in PAPI repression to prevent 
anthocyanin biosynthesis in Arabidopsis seed- 
lings grown under proper conditions. Collect- 
ively, these findings illustrate a molecular 
mechanism underlying PAPI regulation by 
KGDH-ELF6, which exemplifies the KGDH-JMJ 
assembly-mediated regulation of histone demeth- 
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ylations and gene expression at target loci in 
the Arabidopsis genome (Fig. 6F). 


Discussion 


In this study, we have found that in Arabidopsis 
KGDH is targeted into the nucleus through 
the NLS in E2. Eukaryotic E2 subunits contain 
two highly conserved domains, a lipoyl attach- 
ment domain (LAD; near N terminal) and a 
lipoamide acyltransferase domain (LAT; C 
terminal) (53). The nuclear localization sequences 
located between LAD and LAT in the Arabdopsis 


E2a and E2b are not conserved in E2s from 


14 July 2023 


other eukaryotic organinsms such as green algae, 
moss, yeast, or fly, but these eukaryotic E2s 
bear a predicted NLS. This indicates that a small 
portion of the TCA cycle-associated holoenzyme 
KGDHs may be targeted into the nucleus in 
other eukaryotic organisms in addition to 
Arabidopsis. Indeed, it has been reported that 
a minor fraction of KGDH is localized in the 
nuclei of several types of mammalian cells 
(21, 47, 48). Given that the highly conserved 
C-terminal regions of OGDH subunits interact 
with JmjN-bearing N-terminal regions from 
JMJs in Arabidopsis, mouse, and human, it is 
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very likely that the association of nuclear KGDH 
with JMJs to control histone demethylation 
activities is evolutionarily conserved in eukar- 
yotic organisms. 

Nuclear KGDH targeting is regulated by 
environmental signals, and KGDH functions 
together with JMJs to regulate the expression 
of various environment-responsive genes in 
Arabidopsis. The H3K27 demethylases REF6 
and JMJ13 regulate various environment- 
responsive genes in Arabidopsis, and the genes 
involved in responses to various stimuli are highly 
enriched in the REF6-bound loci (15, 52, 54). In 
addition, a large number of IBM1-regulated 
protein-coding genes are stress or environ- 
ment responsive (77). Moreover, an analysis of 
cis-acting DNA elements located in the pro- 
moter regions of JMJs in diverse plant species 
revealed that biotic and abiotic stress-related 
elements are rich in these regions, suggesting 
that the expression of certain JMJs is regu- 
lated by environmental factors (46). These 
findings together reveal that JMJ-dependent 
control of the dynamics of histone methyla- 
tions at environment-responsive loci may play 
a critical role in transcriptome reprogramming 
during environmental responses in plants. 

In several human cancer cell lines, nuclear 
KGDH interacts with lysine acetyltransferase 
2A (KAT2A) (21). The KGDH-coupled KAT2A 
succinylates histone 3 on lysine 79 (H3K79) in 
the proximal promoter regions of a subset of 
genes using the succinyl-coenzyme A gener- 
ated locally by KGDH-dependent a-KG decar- 
boxylation to promote gene expression in the 
cancer cells (27). In this study, we found that in 
Arabidopsis, the nuclear KGDH, in complex 
with various JMJs, regulates genome-wide 
gene expression primarily by inhibiting his- 
tone demethylations by JMJs. It will be interest- 
ing to explore whether the nuclear KGDH may 
couple with an Arabidopsis homolog of KAT2A 
to mediate H3K79 succinylation at a subset of 
loci, in addition to its general role in regulating 
histone demethylation activities of JMJs. 

In summary, we have uncovered that the 
TCA cycle-associated enzyme KGDH enters 
the nucleus in response to environmental 
signals. Nuclear KGDH interacts with various 
JMJs, catalyzes oxidative decarboxylation of 
a-KG, and thus inhibits o-KG-dependent his- 
tone demethylations by JMJs to regulate 
genome-wide gene expression in plants. The 
association of nuclear KGDH with JMJs occurs 
in mammalian cells, suggesting that KGDH 
may function together with JMJs to control 
histone demethylation and gene expression in 
mammals in addition to plants. 


Materials and Methods 
Plant materials and growth condition 


Arabidopsis mutants and transgenic lines used 
in this study are in the Columbia (Col-0) 
background. elf6-3, ref6-1, and jmjl4 were 
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described previously (8, 55). ogdh1-1 (Salk_072348), 
ogdhi-2 (Salk_002473), e2a-1 (Salk_134088), 
and e2b-1 (Salk_005851) were obtained from 
the Arabidopsis Biological Resource Center. 
Arabidopsis plants were grown in long days 
(LDs, 16 hours of light, 8 hours of dark) under 
cool white light with a relative humidity of 
~60% at 22°C. Nicotiana benthamiana plants 
were grown in LDs at ~25°C. 


Anthocyanin extraction and measurement 


Anthocyanins were extracted and measured as 
described previously (56). Briefly, 11-day-old 
seedlings grown on half-strength Murashige 
and Skoog medium were harvested, followed 
by anthocyanin extraction with 1% HCl in 
methanol (v/v). Subsequently, 0.5 volume of 
deionized water and 1 volume of chloroform 
was added to remove chlorophyll. After centri- 
fugation at 13,000g for 5 min, the supernatant 
was collected for spectrophotometric quanti- 
fication at 530 and 657 nm. Relative content of 
anthocyanins was calculated by (A530 - A657)/g 
(fresh weight). Samples were analyzed in three 
biological replicates. 


Metabolic profiling by liquid chromatography 
tandem mass spectrometry 


Measuring of primary metabolites in seedlings 
by liquid chromatography-tandem mass spec- 
trometry (LC-MS/MS) were conducted as pre- 
viously described with minor modifications 
(57, 58). Briefly, seedlings were harvested and 
ground into fine powder in liquid nitrogen, 
followed by metabolite extraction. For each 
sample, ~60 mg of powder was dissolved in 
800 ul of extraction buffer (7 methanol: 3 
chloroform, v/v, precooled at -20°C), and 
incubated at -20°C for 3 hours; subsequently, 
560 ul of double-distilled water (ddH20) was 
added, and 800 ul of supernatant was collected 
after centrifugation at 2200g for 10 min at 4°C, 
followed by another around of extraction with 
800 ul of buffer (1 methanol: 1 ddH20, v/v, 
precooled at 4°C). A total of 1.6 ml of super- 
natant per sample was filtered with a 0.2-uM 
nylon filter; subsequently, 1 ml was taken for 
MS/MS analysis, and 20 ul was used for 
quality control. The extractions were carried 
out on ice. 

For the LC analysis, a Luna NH2 column [3 pm 
(particle), 100 mm (length) x 2 mm (inner di- 
ameter); Phenomenex)] was used. The LC 
gradient was set with eluent A (10 mM ammo- 
nium acetate and 5% v/Vv acetonitrile solution, 
adjusted to pH 9.5 with ammonia water) and 
eluent B (acetonitrile) as follows: 0 to 1 min, 
15% A; 1 to 8 min, 15 to 70% A; 8 to 20 min, 70 
to 95% A; 20 to 22 min, 95% A; and 22 to 
25 min, 15% A. For the MS analysis, QTRAP 
6500+ (AB Sciex) was used in the MRM model, 
and all parameters were derived from previous 
studies with optimization (57, 58). The con- 
centration of each metabolite was calculated 
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using a “concentration-peak area” curve of stan- 
dard samples and presented as nanomoles per 
gram fresh weight. 


Plasmid construction 


To construct OGDHI-GUS and OGDH2-GUS, 
the genomic fragment of OGDHI (the 2-kb pro- 
moter plus the 1.8-kb coding sequence) and 
OGDH2 (the 1.4-kb promoter plus the 1.9-kb 
coding sequence) were fused upstream with 
the coding region for GUS in pMDCI162 (59), 
using Gateway technology (Invitrogen). To 
create pOGDH1-OGDHI1: HA, a genomic OGDH1 
fragment consisting of 2.0-kb promoter plus 
the 3.8-kb entire genomic coding sequence 
(without the stop codon) was first fused with 
a 3x HA tag sequence, which was subsequently 
cloned into pHGW (60) using Gateway tech- 
nology. To create pOGDHI-OGDHI:GFP, the 
5.8-kb genomic OGDHI fragment was fused 
upstream with the coding region for GFP in , 
pMDC1I0 (59) using Gateway technology. For 
pOGDH2-OGDH2:GFP construction, a genomic 
OGDH2 fragment consisting of 1.4-kb promoter 
plus the 4.3-kb entire genomic coding sequence 
(without the stop codon) was fused upstream 
with GFP in pMDCIIO. 

To construct p35S-OGDHI:Flag and p35S- 
OGDH2:Flag, the full-length genomic coding 
sequences for OGDHI (3.8 kb) and OGDH2 
(4.3-kb) without the stop codon were first 
fused with a 3x Flag, which were subsequently 
cloned downstream of the 35S promoter in 
pMDC32 (59). To create p35S-OGDH2:SV40yz5- 
Flag, the full-length OGDH2 genomic coding 
sequence (4.3 kb, without the stop codon) was 
first fused with the nuclear location signal of 
Pro-Lys-Lys-Lys-Arg-Lys-Val from SV40 (50); sub- 
sequently, a 3x Flag tag sequence was added to 
the fusion (at 3’ end), which was then cloned 
downstream of the 35S promoter in pMDC32. 

To create pE2a-E2a:GFP, a genomic frag- 
ment of E2a consisting of 1.6-kb promoter plus 
3.2-kb genomic coding sequence (without stop 
codon) was fused with GFP in pMDC110. For 
pE2a-mtE2a-1:GFP and pE2a-mtE2a-2:GFP con- 
structions, the 4.8 genomic fragment of £2a 
bearing mutations for E2a-1 (R236A, R237A, 
R242A and R244A) or mutations for E2a-2 
(K221A, S223A, K232A and R234A), was fused 
upstream with GFP in pMDCZI0. To construct 
p35S-E3a:HA, the full-length E3 genomic se- 
quence (2.1 kb, without the stop codon) was 
first fused with a 3xHA tag, and the fusion was 
subsequently cloned downstream of the 35S 
promoter in pMDC32. 

To construct pREF6-REF6:HA, a genomic 
fragment of REF6 consisting of 1.7-kb pro- 
moter plus the 5.2-kb entire genomic coding 
sequence (without the stop codon) was first 
fused with a 3xHA tag, and then the fusion 
was cloned into pHGW (60). For pELF6- 
ELF6:GFP construction, a genomic fragment 
of ELF6 consisting of the 0.7-kb promoter 
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plus the entire 5.5-kb genomic coding se- 
quence (without the stop codon) was fused 
upstream with GFP in pMDC1I0. To create 
pJMJ13-JMJ13:Flag, a genomic fragment of 
JMJ13 consisting of the 2.1-kb promoter plus 
the entire 5.5-kb genomic coding sequence 
(without the stop codon) was first fused with 
a 3x Flag tag sequence [as described in (75)], 
and then the fusion was cloned into pHGW. 
For p35S-HTA9:RFP construction, the full- 
length genomic sequence of HTA9 (encoding 
H2A.Z; 0.83 kb) without the stop codon was 
cloned into a modified pCAMBIA2300 vector, 
resulting in an HTAQ fusion with RFP. 

To construct Hs_OGDHI1:Flag and Hs_KDM4<A: 
HA, the coding regions of Hs_OGDHI and 
Hs_KDM4<A were fused with 3x Flag and 3x HA, 
respectively, and the fusions were subsequently 
cloned into pcDNA3.1 (GenScript Biotech) for 
transient expression in mammalian cells. For 
p35S-GST:Flag and p35S-GFP:HA constructions, 
GST:3xFlag and GFP:3xHA were cloned down- 
stream of the 35S promoter in pB2GW7 (60). 


Cellular protein fractionation 


Cellular fractionation assays were conducted 
as described previously (67). Briefly, seedlings 
were ground into fine powder in liquid nitro- 
gen and lysed in a fractionation lysis buffer 
[20 mM Tris-HCl, pH 7.5, 2 mM EDTA, 2.5 mM 
MgClo, 20 mM KCl, 5 mM dithiothreitol 
(DTT), 25% glycerol, 250 mM sucrose, and 1x 
Roche protease inhibitor cocktail], followed 
by centrifugation at 1500g for 10 min at 4°C. 
Subsequently, cytoplasmic proteins from the 
supernatant and nuclear proteins from the 
pellet were extracted, followed by immuno- 
blotting. The cytoplasmic protein marker Actin 
and nuclear protein marker histone 3 (H3) 
were detected with anti-Actin (Agrisera, AS132640) 
and anti-H3 (Millipore, 07-690), respectively. 
Nuclear fractions were mitochondrion free 
(fig. S6C). 


Yeast two-hybrid assay 


Yeast two-hybrid assays to examine protein- 
protein interactions were conducted using the 
Matchmaker Gold System (Clontech). Coding 
sequences of genes of interest were cloned 
into pGADT7 or pGBKT7. Paired plasmids 
were introduced into the yeast strain AH109. 
Subsequently, the transformed yeast cells were 
spread on solid dropout medium without 
leucine/tryptophan/adenine/histidine (-L/W/ 
A/H) to determine protein-protein interactions. 


BiFC assay 


The full-length coding sequences for OGDH1 
and ELF6 were fused in frame with an N- 
terminal and a C-terminal EYFP fragment, 
respectively; subsequently, this pair of plasmids 
together with the nuclear marker plasmid of 
HTA9:RFP were introduced into N. benthami- 
ana epidermal cells by Agrobacterium-mediated 
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transient transformation as described previously 
(62). Within 48 hours after inoculation, the 
fluorescence of EYFP and RFP was imaged 
with a Leica TCS SP8 SMD microscope. 


Co-IP 


Co-IP experiments with Arabidopsis seedlings 
or N. benthamiana leaves expressing proteins 
of interest were conducted as previously de- 
scribed with minor modifications (63). Briefly, 
Arabidopsis seedlings were first treated with 
10 uM MG132 (Sigma, M8699-IMG) for 3 hours, 
and subsequently harvested for total protein 
extraction. The extracts were immunoprecipi- 
tated with anti-HA beads (Sigma, SAE0197), 
anti-Flag beads (Sigma, M8823), or anti-GFP 
beads (Abcam, ab193983), followed by Western 
blotting to examine the presence or abundance 
of the proteins of interest in the precipitates. 

HEK 293F cells (64) were cultured and 
transfected using polyethylenimine as described 
previously (65). Transfected cells were lysed in 
the NETN buffer (100 mM NaCl, 1 mM EDTA, 
20 mM Tris-HCl, pH 8.0, and 0.5% Nonidet 
P-40), followed by total protein extraction as 
described previously (66). Subsequently, the 
extracts were immunoprecipitated with anti- 
HA beads (Sigma, SAEO197), followed by 
Western blotting. 


Histone extraction and immunoblotting 


Total histones were extracted and analyzed as 
previously described (67). Briefly, histones were 
extracted from 11-day-old seedlings, followed 
by immunoblotting with anti-H3K4mel (Abcam, 
ab8895), anti-H3K4me2 (Millipore, 07-030), anti- 
H3K4me3 (Millipore, 05-745), anti-H3K9me2 
(Millipore, D5567), anti-H3K27me2 (Millipore, 
07-452), anti-H3K27me3 (Millipore, 07-449), 
anti-H3K36me3 (Millipore, ABE435), and anti- 
H3 (Millipore, 07-690). Blotting signals or band 
intensities were quantified using ImageJ 
software. 


RNA analysis by reverse transcription coupled 
with qPCR 


Total RNAs were extracted from about 11-d- 
old seedlings using RNeasy Plus mini kit 
(Qiagen), following the manufacturer’s instruc- 
tions, and reverse transcription (RT)-qPCR 
assays were performed as previously described 
(67). Briefly, 2 ug of total RNA was reversely 
transcribed to synthesize cDNAs, followed by 
real-time qPCR on an ABI QuantStudio real- 
time PCR system using a master mix of SYBR 
Green (Bio-Rad). The constitutively expressed 
TUB2 served as an internal control. Three 
biological replicates were conducted for each 
sample, and primers are listed in table S5. 


RNA-seq analysis 


RNA-seq analysis was conducted as previously 
described (67). Briefly, total RNAs were extracted 
from seedlings using the RNeasy Plus mini kit 
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(Qiagen), followed by cDNA library construction 
with the NEBNext Ultra Directional RNA 
Library Prep kit for lumina (New England 
BioLabs, E7420L). cDNA libraries were se- 
quenced on an Illumina NovaSeq6000 se- 
quencer. Three biological replicates for each 
sample were conducted. RNA-seq data were 
processed as previously described (67), and 
DEGs were identified with the edgeR package 
(FDR < 0.05) (68). 


Immunoprecipitation of nuclear 
OGDH/KGDH complex 


About 1 g of seedlings expressing OGDH1:Flag 
were harvested, followed by nuclear isolation 
as previously described (69) with minor mo- 
difications. Briefly, frozen seedlings were ground 
into fine powder and dissolved in ice-cold nuclei 
isolation buffer [20 mM HEPES, pH 8, 250 mM 
sucrose, 1 mM MgCly, 5 mM KCl, 40% glycerol, 
0.25% Triton X-100, 0.1 mM phenylmethylsul- _ 
fonyl fluoride (PMSF), and 0.1% 2-mercapto- 
ethanol], followed by filtering through Miracloth 
and centrifugation to separate nuclei from tissue 
debris. After several washes with ice-cold nuclei 
isolation buffer, intact nuclei were collected, 
followed by total nuclear protein isolation 
using an extraction buffer (50 mM Tris-HCl, 
pH 7.5, 150 mM NaCl, 5 mM EDTA, 0.2% 
Triton X-100, 0.2% Nonidet P-40, 1 mM PMSF, 
and 1x Roche protease inhibitor cocktail). 
Nuclear KGDH complexes were immuno- 
precipiated with anti-Flag beads (Sigma, M8823), 
followed by several washes with a washing 
buffer (20 mM Tris-HCl, pH 7.5, 150 mM NaCl, 
8.0% glycerol, and 1 mM DTT). 


Measurement of KGDH activity 


The enzymatic activity of nuclear KGDH was 
measured using the o-Ketoglutarate Dehydro- 
genase Activity Colorimetric Assay Kit (Sigma, 
MAK189) according to the manufacturer’s 
instructions with minor modifications. Briefly, 
the anti-Flag beads carrying OGDH1:Flag/ 
KGDH and mock (precipitates from WT by 
anti-Flag beads) were incubated with the 
KGDH assay buffer, KGDH developer, and 
KGDH substrate from the kit. The absorbance 
of A450 was measured with a Varioskan Flash 
microplate reader (Thermo Scientific) at dif- 
ferent time points of reaction at 37°C. The 
amount of NADH (nanomoles/well) gener- 
ated by the KGDH holoenzyme between time 
point 1 and time point 2 was calculated ac- 
cording to a standard curve. Total proteins per 
reaction were measured with a Bradford pro- 
tein assay kit (Sangon Biotech) according to 
the manufacturer’s instructions. KGDH activ- 
ity was calculated as nanomoles of NADH pro- 
duced per microgram of protein per minute. 
To measure KGDH activity in total nuclear 
proteins, nuclei were first extracted from 
about 1 g of seedlings of WT, e2a-1, and ogdh1/2 
as described previously (69). Subsequently, the 
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nuclei resuspended in the KGDH assay buffer 
sample were sonicated, followed by centrifuga- 
tion at 10,000g for 5 min. Total nuclear proteins 
in the supernatants were used for o-KG de- 
carboxylation and KGDH activity assays. 


Histone demethylation assay 


The histone demethylation activities of the 
nuclear OGDH/KGDH-associated JMJs were 
measured as previously described (55). Briefly, 
the anti-Flag beads carrying OGDH1-Flag/KGDH 
and mock were incubated with calf thymus 
histones (Sigma) in an assay buffer [20 mM 
Tris/HCl pH 7.5, 150 mM NaCl, 50 uM (NH4) 
2Fe(SO4)2, 1 mM o-ketoglutarate, and 2 mM 
ascorbic acid] for 4.5 hours at 37°C. In sepa- 
rate reactions, succinate (10 mM) was incubated 
briefly with the OGDH1-Flag immunoprecipi- 
tates before adding other reaction mixtures 
(49). The histones recovered from the reactions 
were separated by SDS-polyacrylamide gel 
electrophoresis and transferred to 0.2-um 
nitrocellulose membranes, followed by immu- 
noblotting with anti-trimethyl H3K4 (Milli- 
pore, 05-745), anti-monomethyl H3K4 (abcam, 
ab8895), anti-trimethyl H3K27 and anti-mono- 
methyl H3K27 (Millipore, 07-448), or anti-H3 
(Millipore, 07-690). 


ChIP-seq 


ChIP-seq experiments to analyze genome-wide 
occupancy of OGDH1:Flag were performed as 
described previously with minor modifications 
(70). Briefly, 11-day-old seedlings expressing 
OGDH1:Flag (pretreated with 10 uM MG132) 
were fixed with 1% formaldehyde in a fixation 
buffer (0.4 M sucrose, 10 mM Tris-HCl, pH 8.0, 
1 mM EDTA, and 1 mM PMSF), followed by 
nucleus isolation as described previously (69). 
Subsequently, nuclei were resuspended in a 
lysis buffer, followed by chromatin extraction 
and sonication. OGDHI1:Flag-bound chromatin 
was immunoprecipitated with anti-Flag beads 
(Sigma, M8823). After de-cross-linking, DNA 
fragments were purified with the QIAquick 
PCR purification kit. DNA libraries for se- 
quencing were constructed using an NEBNext 
UltraTM II DNA Library Prep Kit for Iumina 
(New England BioLabs), and sequenced on an 
Illumina NovaSeq6000 platform. Two biologi- 
cally independent samples were analyzed. 

ChIP-seq analysis of H3K27me3 in WT and 
ogdh1 ogdh2 was conducted as described pre- 
viously (70). Briefly, WT and ogdhI ogdh2 
seedlings were fixed, followed by nucleus iso- 
lation. Total chromatin was extracted and 
sonicated, followed by immunoprecipitations 
with anti-H3K27me3 (Millipore, 07-449). After 
de-cross-linking, DNA fragments were puri- 
fied; subsequently, DNA libraries for sequenc- 
ing were constructed and sequenced on an 
Illumina NovaSeq6000 platform. Three bio- 
logically independent samples were analyzed 
for each genotype. 
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Processing of ChIP-seq data 

Raw sequencing data of OGDH1:Flag ChIP 
were cleaned with trim_galore (version 0.4.4) 
(https://github.com/FelixKrueger/TrimGalore), 
and adapters and reads with quality lower than 
30 and length shorter than 50 bp were filtered. 
Reads were mapped to the TAIR1O refer- 
ence genome using bowtie2 (version 2.2.9) 
(71) with default parameters, and the map- 
ping files were filtered by samtools (version: 
0.1.19) (72). Peaks were called using MACS2 
(parameters:-gsize 1.19e8-qvalue 0.01) (73). 
The coverage calculation of alignments was 
performed using bamCoverage from deepTools2. 
To profile peak distribution, computeMatrix 
(parameters: -a 1000 -b 1000 -m 2000) and 
plotProfile (parameters:-plotHeight 8-plotWidth 
12 -perGroup) from deepTools were used. To 
identify the loci where the peaks are located, 
ChIPSeeker (74) was used to annotate the 
peaks, and the genes with peaks were visual- 
ized by the WashU Epigenome Browser (75). 
In addition, the heatmaps of peaks were 
plotted using plotHeatmap from deepTools. 
To examine the consistency between replicates, 
the average scores of bam files were obtained 
using multiBigwigSummary from deepTools2 
(version version 3.5.1) (76), and correlation 
analysis was performed with plotCorrelation 
from deepTools (parameters:-corMethod 
pearson—skipZeros). 

ChIP-seq data of H3K27me3 were processed 
and analyzed as previously described (70). 
Enriched peaks were called by MACS with the 
cut-off P value of 0.05, and read coverage per 
1x sequencing depth (reads per genomic content) 
or reads per million (at single-nucleotide resolu- 
tion) over target loci were scored. The reads per 
million score differences of H3K27me3 enrich- 
ment between ogdhI/2 and WT at the examined 
loci were calculated with computeMatrix from 
deepTools and visualized using plotHeatmap 
from deepTools. 


ChIP-qPCR 


ChIP assays were performed as described 
previously (70) with minor modifications. Briefly, 
the seedlings expressing OGDH1:HA, OGDH1: 
Flag, OGDH2:GFP, JMJ14:Flag, or ELF6:GFP 
were fixed with 1% formaldehyde in the 
fixation buffer, followed by nucleus isolation 
as described previously (69). Total chromatin 
was extracted and sonicated, followed by 
immunoprecipitations with anti-HA (Sigma, 
H9658), anti-Flag beads (Sigma, M8823), or 
anti-GFP (Abcam, ab290). After de-cross-linking, 
DNA fragments was purified with the QIAquick 
PCR purification kit. Subsequently, DNA frag- 
ments of interest were quantified on an ABI 
QuantStudio Real-Time PCR System using a 
master mix of SYBR Green (Bio-Rad). Three 
biologically independent samples were ana- 
lyzed in each experiment, and each sample was 
quantified in triplicate. Primers are listed in 


14 July 2023 


table S5. Relative enrichment of a protein of 
interest at each examined region was first nor- 
malized to the endogenous control TUB2, 
followed by normalization to a background 
control (WT). 
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INTRODUCTION: As part of their life cycle, viruses 
must escape from their hosts. For bacteriophages, 
this process requires breaching the bacterial cell 
wall and the peptidoglycan layer. Small, single- 
strand DNA and RNA phages have evolved single- 
gene lysis proteins that disrupt peptidoglycan 
biosynthesis and trigger cell lysis, a mechanism 
also used by some antibiotics. The best-studied 
of these is protein E from the historically impor- 
tant phage ®X174, a 91-amino acid protein that 
contains a conserved N-terminal transmembrane 
helix and an extended cytoplasmic C-terminus. 


RATIONALE: Mutagenesis studies have dem- 
onstrated that the protein E transmembrane 
domain is sufficient for lysis, with functional 
mutants mapping to one face of the single 
transmembrane helix. The strongest pheno- 
types were observed for conserved prolines in 
this domain of protein E, with one proline being 
essential for lysis. Wild-type protein E requires 
the constitutively expressed bacterial chaper- 
one SlyD (sensitivity to lysis D), which contains 
a prolyl isomerase domain and a chaperone 
domain. Previous work had identified mutants 


The escape of ©X174 from its bacterial host. In the membrane is the YES complex [E. coli enzyme MraY 
(cyan), phage protein E (yellow), and E. coli chaperone SlyD (purple)] where protein E disrupts peptidoglycan 
synthesis by inhibiting MraY and allowing breaching of the cell wall (tan). Assembled ®X174 phage particles 
are shown in gray. In the background are bacterial cells that are lysing at their septal division point. 
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in MraY that confer resistance to proteir ie 
mediated lysis. MraY catalyzes the synth 2-2 
of a peptidoglycan precursor from a soluble 
nucleoside-sugar-peptide and a phospholipid 
substrate. Although many functional ques- 
tions remain about MraY, here we sought to 
understand how a phage protein could spe- 
cifically inhibit this key enzyme and what role 
SlyD performed in the process. 


RESULTS: We established that phage protein E 
forms a stable complex with the Escherichia 
coli proteins MraY and SlyD, designated the 
YES complex (MraY, protein E, SlyD). After 
solubilizing in detergent, we determined struc- 
tures of the YES complex by single-particle elec- 
tron cryo-microscopy. In our structures, the 
MraY dimer adopts a back-to-back orientation 
with the membrane-exposed active sites facing 
away from each other. Two SlyD molecules can 
be fit into the density on the cytoplasmic side. 
The two MraY and SlyD chains are bridged by 
two protein E molecules. The transmembrane 
helix of protein E occupies a groove on MraY 
corresponding to the putative binding site of the 
lipid substrate. At the cytoplasmic surface, the 
transmembrane domain of protein E bends to 
cross the active site, followed by an a-helix that ‘ 
extends to the chaperone domain of SlyD. The C- 
terminus of protein E continues through a 
peptide-binding groove in the second SlyD and 
into the prolyl isomerase active site. The conse- * 
quence is that protein E inhibits MraY by block- 
ing lipid access to the active site. Previous protein 
E mutant phenotypes are explained by the struc- 
ture including the essential proline that allows for 
a kink in the transmembrane helix. For MraY, we 
can resolve the full chain including the conserved 
loop between TM] and TM2. We show that the 
N terminus of MraY forms an o-helical stack 
with TM2 and identify ordered lipids on the 
protein surface. Finally, we provide evidence < 
that the role of SlyD is to stabilize the complex. ‘ 


CONCLUSION: Protein E directly inhibits MraY 
by obstructing the MraY active site in a stable . 
complex with SlyD. This structure resolves key 
questions about how the model phage ®X174 
kills bacteria and escapes the cell. In the BX174 
genome, the gene encoding for protein E is 
evolutionarily constrained by gene D, in which 
it is embedded. The YES complex provides a 
route for the rational design of protein E beyond 
this gene D restraint. Single-gene lysis proteins, 
like protein E, serve as useful models for the 
development of antibacterial therapies. = 


The list of author affiliations is available in the full article. 
*Corresponding author. Email: clemons@caltech.edu 
tThese authors contributed equally to this work. 
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ANTIMICROBIAL PROTEINS 


The mechanism of the phage-encoded protein 


antibiotic from @X174 


Anna K. Orta‘, Nadia Riera’+, Yancheng E. Li’, Shiho Tanaka’, Hyun Gi Yun’, 


Lada Klaic’, William M. Clemons Jr.)* 


The historically important phage ®X174 kills its host bacteria by encoding a 91-residue protein 
antibiotic called protein E. Using single-particle electron cryo-microscopy, we demonstrate that protein 
E bridges two bacterial proteins to form the transmembrane YES complex [MraY, protein E, sensitivity 
to lysis D (SlyD)]. Protein E inhibits peptidoglycan biosynthesis by obstructing the MraY active site 
leading to loss of lipid | production. We experimentally validate this result for two different viral species, 
providing a clear model for bacterial lysis and unifying previous experimental data. Additionally, we 
characterize the Escherichia coli MraY structure—revealing features of this essential enzyme—and the 
structure of the chaperone SlyD bound to a protein. Our structures provide insights into the mechanism 
of phage-mediated lysis and for structure-based design of phage therapeutics. 


he full realization of phage therapy as 

a solution to the antimicrobial resist- 

ance problem requires a fundamental 

understanding of the mechanisms vi- 

ruses use to kill their host (/-4). Nearly 
100 years ago, the first medical application 
of phages to treat infections used a cocktail 
containing the historically important phage 
®X174 (5). A rich source of critical discov- 
eries in molecular biology (6-9), ®X174 is a 
member of the Bullavirinae subfamily within 
Microviridae. It is found broadly in environ- 
ments that contain coliform bacteria, such as 
the human gut (J0), with Escherichia coli as 
its primary host. The 1977 publication of the 
®X174 single-stranded 5.4-kilobase DNA ge- 
nome was a milestone for genomics, reveal- 
ing 11 open reading frames (ORFs) (17). Most 
notable was the gene for lysis, EF, that is em- 
bedded within the ORF of the scaffolding gene 
D (12). Expression of F alone is sufficient to kill 
bacteria (73). Although a variety of lysis mech- 
anisms have been proposed (J4-16), the most 
likely is that the product of EF, protein E, disrupts 
peptidoglycan (PG) biosynthesis, culminating 
in a breach in the cell wall. In particular, pro- 
tein E was demonstrated to directly inhibit the 
integral membrane enzyme MraY (16) depen- 
dent on the cytoplasmic chaperone SlyD (sen- 
sitivity to lysis D) (17, 18). 

Phage-derived single-gene lysis (SGL) pro- 
teins, such as protein E, trigger cell lysis by 
inhibiting PG biosynthesis [reviewed in (19)] 
providing a route for killing bacteria. Protein 
E includes a conserved N-terminal transmem- 
brane domain (TMD), a cytoplasmic C-terminal 
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domain with conserved positively charged 
residues in a predicted amphipathic helix, 
and an unstructured tail (Fig. 1A and fig. S1). 
Fusions of globular proteins to the C termi- 
nus of the TMD have demonstrated that the 
TMD is sufficient for lysis (17, 20, 21). Exten- 
sive mutational analysis identified key res- 
idues in protein E including the essential 
proline 21 (P21) (17, 20-23). The mechanism 
through which SlyD and MraY work with 
protein E to facilitate rupture of the cell wall 
remains unknown. MrayY, an important target 
for antimicrobial drug discovery [reviewed 
in (24)], catalyzes the transfer of a phospho- 
MurNAc-pentapeptide from “Park’s nucleotide” 
(UDP-MurNAc-pentapeptide) onto the phos- 
phate group of the 55 carbon polyisoprenyl- 
phosphate (C;;P) generating lipid I. X-ray 
crystal structures of MraY in detergent from 
the thermophilic Gram-negative Aquifex aeolicus 
(AaMraY) (25, 26) or the Gram-positive Enterocloster 
bolteae (EbMraY) (27) revealed a conserved 
homodimeric structure, with ten TMDs per 
subunit. The structures localized the active 
site in a vestibule on the cytoplasmic side of a 
membrane-exposed cleft formed by TMDs 4, 
5, and 9, which is the predicted access site for 
the lipid substrate (28). The constitutively 
expressed but nonessential (29) metallocha- 
perone (30) SlyD contains two conserved core 
domains: an FK506-binding protein (FKBP) 
prolyl-isomerase domain and an insertion-in- 
flap (IF) domain that binds unfolded peptides 
by B-augmentation (18, 31-33). The disordered 
SlyD C terminus (37), although important for 
metal binding, can be deleted without affect- 
ing chaperone activity (34). 

In the present study, we resolve the mech- 
anism of peptidoglycan biosynthesis inhibition 
by protein E. We determined the structure of 
the dimeric heterotrimer YES complex (ZcMraY, 


viral protein E, EcSlyD) revealing that protein 
E physically blocks the lipid substrate from 
accessing the MraY active site. This work 
provides mechanistic insight into all three 
proteins and suggests a path toward devel- 
opment of antibacterial agents. 


The structure of the YES complex 


We used the protein E sequence from either the 
original phage ®X174 or the shorter protein E 
isoform from phage ID21 (91 and 76 residues 
respectively), both from within Bullavirinae 
(Fig. 1A and fig. SI). We coexpressed an affinity- 
tagged protein E@x174 and wild-type (WT) 
EcMrayY. After purification, a stable complex 
formed a single peak during size-exclusion 
chromatography (SEC) but resolved into four 
bands on a gel, two of which corresponded to 
the endogenous EcSlyD (fig. S2A). A SlyD variant 
with the disordered C terminus removed res- 
cued lysis activity in an E. coli AslyD strain (fig. 
S2, B to D) and ran as a single band on a gel (Fig. 
1B) (8, 30). This truncation, SlyD,;,, was used 
for subsequent work except where noted. 

The two protein E isoforms induced lysis at 
similar efficiencies when expressed in a WT 
E. coli strain (Fig. 1C). For purification, all three 
genes in the YES complex (WT EcMraY, protein 
E, and truncated EcSlyD) were recombinantly 
expressed together in the AslyD strain with 
a C-terminal affinity tag on protein E. The 
complex was extracted in detergent and ran 
as a single peak by SEC with all three proteins 
in an apparent stoichiometric complex (Fig. 
1B). Structures for both the YES@xj174 and 
YESjp21 complexes were solved using single- 
particle electron cryo-microscopy (cryo-EM) 
(Fig. 1D and figs. S3 and S4). The final density 
maps were obtained following several rounds 
of data processing with heterogeneous and ho- 
mogeneous refinements (figs. S3 and S4). The 
final overall masked resolution was 3.5 A for 
the YES;p2; complex and 3.6 A for the YES@xi7. 
complex (fig. S5). Statistics are provided in 
table S2. For both structures, the resolution 
was higher for regions in and adjacent to the 
membrane (figs. S3 and S4). We unambiguously 
built 90% of the protein residues in the complex. 
Sequence differences between the protein E 
variants are visible in the density (fig. S5), gen- 
erally exposed on the surface of the complex. 
The YES;p2; complex is used as the reference 
structure, except where noted. 

Within the density map, we could clearly 
distinguish two copies of each member of the 
YES complex (six separate proteins). When 
contoured to remove the detergent micelle, 
densities for 22 TMDs are clear, 20 of which 
are accounted for by the EcMraY dimer (Fig. 
1, D and E). Most of the cytoplasmic density 
can be accounted for by two SlyD molecules, 
which is the most flexible region of the com- 
plex (fig. S6D). The remaining protein den- 
sity corresponds to two protein E molecules 
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Fig. 1. The structure of the YES complex. (A) An alignment of a representative 
subset of protein E isoforms. Residue coloring is based on ClustalW. Secondary 
structure elements are shown above the sequence. Sequences are ordered as in 
fig. Sl. Residues highlighted with a (*) at the bottom of the alignment are discussed 
in the text. (B) SDS-PAGE gel of the purified YESjp2; complex. (C) A lysis assay 
for protein E expression. Cells containing either empty vector (black line) or the 
protein E genes for ID21 (pink line) or @X174 (green line) were induced at time 0 and 
the absorbance at 600 nm was monitored over time. Error bars represent the 
standard deviation derived from n = 3. (D) Overview density maps of the YESip21 


complex viewed in the plane of the membrane. The detergent micelle is highlighted 
by gray coloration. The higher contoured map shows the six components of the 
twofold complex with density for E. coli MraY (cyan), protein E (yellow), and E. coli 
SlyD (purple) highlighted. The pairs of each protein are distinguished as the 
B-subunit is light purple. The general coloring scheme is maintained throughout 
the figures. Density that is likely lipid is shown in transparent orange. (E) Illustrated 
representation of the YESip21 complex oriented and colored as in (D). Foreground 
TMDs for the MraYs are numbered and the bilayer is represented by lines. 

N terminus and C terminus of protein E, are labeled. 


that each contain a kinked TMD and a solu- 
ble domain that bridges between MraY and 
SlyD. Densities for lipids are visible around 
the membrane-exposed surface of the MraY 
dimer (Fig. 1D). 

For protein E, starting at the N terminus 
in the periplasm, the TMD binds in the groove 
formed between TM5 and TM9 of MraY end- 
ing on the cytoplasmic side where it makes a 
sharp turn into the active site pocket (Figs. 1E 
and 2A). The TMD is followed by an amphi- 
pathic helix that crosses the active site, parallel 
to the membrane, presenting a positive face 
toward MraY and a hydrophobic face toward 
a SlyD IF-domain. The C-terminal residues 
adopt an extended conformation that pri- 
marily interacts with the second SlyD. This 
results in a crossover point between the two 
protein Es with each contacting both SlyDs. 
Overall, the dimeric complex (two of each of 
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the heterotrimers) has a near twofold sym- 
metry perpendicular to the plane of the mem- 
brane. The membrane and periplasmic facing 
regions overlay perfectly but the symmetry 
is broken at the cytoplasmic face where the 
C-termini of the two protein E molecules 
cross each other at different residues and the 
SlyDs adopt slightly different orientations 
(Fig. 1E and fig. S6). Following toward the end 
of protein E, we can see continuous backbone 
density that positions proline 65 in the ac- 
tive site of the FKBP domain. Beyond that, the 
density is insufficient to resolve the sequence 
and we see little difference between ®X174 and 
1D21 (fig. S5). 


The interaction of protein E with MraY 


Functional studies have consistently revealed 
the requirement for a proline at position 21 
(21, 22, 35). Our structure allows an elegant 


explanation for this requirement. The pro- 
tein E TMD binds in the cleft of MraY that 
is defined by the angled TM9b (Fig. 2A). The 
proline at position 21 breaks the hydrogen 
bonding and introduces a kink that allows 
bending of the TMD around TM9b following 
the groove to the active site. Mutation of P21 
would result in loss of the kink favoring a 
straight TMD that could not bind in the groove. 
Residue P29, also completely conserved (Fig. 
1A and fig. SI), creates a second bend that com- 
pletes the wrap around TM9b and a mutation 
at this position results in delayed lysis onset 
(21, 22). Additional alanine mutations in the 
TMD identified residues that result in de- 
layed lysis onset, postulated to decrease the 
binding affinity of protein E to MraY (27). In 
the present structure, most of these residues 
(L19, L20, L23, and M26) (Fig. 2C) make direct 
contact with MraY. The exception, F27, appears 
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Protein E 


MraY, 


Fig. 2. The interaction of protein E with EcMraY. (A) Illustration of the 
YESjp21 complex as in Fig. 1E with a 90° rotation except that MraYa is color- 
ramped using the Viridis palette that goes from purple to yellow for the N- to 
C-terminus, respectively. Prolines are shown in red. The bilayer is modeled in 
white. The B-subunits and SlyDs are faded. Boxes indicate regions highlighted in 
panels (B to D). Side chains for protein E residues that contact MraY are shown 
as sticks. (B) as in (A) with positions of protein E resistance mutants in MraY 


to sterically position M26 into a tight inter- 
action with Y134 in MraY, which is conserved 
in most Gram-negative bacteria (fig. S7). 

At the cytoplasmic interface, protein E res- 
idues A36 through M50 form the amphipathic 
helix spanning the width of an MraY subunit. 
The hydrophilic face of this helix orients toward 
the membrane in the MraY active site. The helix 
contains conserved positively charged residues 
that interact with conserved negatively charged 
residues in MraY (Fig. 2D). An example is the 
K46 salt bridge where, in our lysis assay, a K46A 
mutation resulted in delayed lysis onset (Fig. 
2E). Here, isoform differences can be visualized. 
For example, position 42 is a leucine in ®X174 
whereas in ID21 it is an arginine that forms a 
salt bridge with E335 in MraY (Fig. 1). The helix 
ends at the crossover between the two protein 
Es at residue V54 in protein E, and A51 in 
protein Eg, resulting in differing interactions 
with residues in MraY, such as the essential 
H326 (Fig. 2D). 

The Epos (plaques on As/yD) mutations, R3H 
and LI19F, allow phage propagation in a AslyD 
background (23). The R3H mutant of protein 
E in ®X174 results in a silent mutation in pro- 
tein D and is found native to other species 
such as ID21 (fig. S8). Previous work reported 
that this variant resulted in higher levels of 
protein E in the membrane (23). In the present 
structure (fig. S5, G and H), this residue does 


not make specific contact with MraY. It is 
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likely that the loss of a positive charge in the 
periplasm favors a higher percentage of pro- 
tein E molecules that are correctly inserted in 
the membrane as a result of the positive-inside 
rule. L19F does not result in higher levels of 
protein E (27) and likely hastens lysis onset 
by having a higher affinity to MraY. Another 
phenylalanine mutation at the interface, L23F, 
also hastens lysis onset likely by higher affi- 
nity, although this is not a general rule as other 
leucine to phenylalanine mutants did not af- 
fect lysis (22). Both L19 and L23 are near the 
conserved F182 in MraY and may add addi- 
tional stability through aromatic x interactions 
(Fig. 2C). The opposite mutation, phenylala- 
nine to leucine, can show loss of binding. For 
example, the F288L mutation in MraY (J6) is 
located at the interface with protein E and re- 
sults in a loss of lysis, most likely due to lower 
affinity. 

All the mutations in MraY that allow re- 
sistance to protein E-mediated lysis (P170L, 
AL172, G186S, F288L, V291M) (6, 35) are at 
the interface with protein E (Fig. 2B). The 
mutant F288L, as noted above, lowers affi- 
nity to protein E. Residue G186 is at the near- 
est approach between the two proteins and a 
mutation to serine would prevent protein E 
binding. The mutant V291M lies directly at the 
interface near L19 in protein E, although a 
specific effect for this mutation is not clear. 
Finally, P170L and AL172 are located within 
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periplasmic loop 4-5, which interacts with the 
conserved N terminus of protein E. Although 
more resistant to lysis, these two mutants are 
predicted to still bind protein E, albeit with 
lower affinity (35). 


The mechanism of inhibition of MraY 
by protein E 


The YES complex structure allows us to pro- 
pose a simple mechanism for inhibition of 
MraY by protein E. Superposition with previ- 
ous MraY inhibitor complexes (26, 27) on the 
YES complex (Fig. 3) shows that the predicted 
path of the polyisoprenyl chain of C;;P is the 
groove formed between TM5 and TMQ, which 
is occluded by protein E. Therefore, one mech- 
anism of inhibition is that protein E prevents 
access of the lipid substrate to the active site 
(Fig. 3A). Protein E is a noncompetitive inhib- 
itor of Park’s nucleotide (36) and, accord- 
ingly, the pocket that binds the nucleoside is 
fully accessible in the present structure (Fig. 
3B). Loop 9-10 in MraY contains catalytic his- 
tidines that must move toward the binding 
pocket to facilitate catalysis by completing 
the active site (28). The cytoplasmic helix of 
protein E separates loop 9-10 from the rest of 
the active site, blocking this transition and 
providing a second mechanism of inhibition. 
Overall, protein E blocks access of the lipid 
substrate and prevents formation of the active 
site upon substrate binding. 
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Ocarbacaprazamycin 
@ tunicamycin 


Fig. 3. The mechanism of inhibition by protein E. 
(A) Accessible surface of MraY (colored as in Fig. 2A) 
viewed from the cytoplasm looking toward the active 
site cleft. Protein E is shown as an illustration. The 
structures for the inhibitors tunicamycin (maroon) and 
carbacaprazamycin (pink) are shown as sticks based 
on a structural alignment to EcMraY, with their 
respective complex structures [PDBID: 60YH (26) and 
5JNQ (27)]. (B) Similar to (A) from a slightly different 
angle highlighting the catalytic pocket. MraY (dark 
cyan) is shown as an illustration. The two substrate 
binding sites are highlighted by dashed boxes. Predicted 
catalytic residues in MraY are shown as sticks. 


Key structural features of MraY 

Although there are prior crystallographic MraY 
structures (25-27), our YES complex structure 
reveals FE. coli MraY in a distinct structural 
context (fig. S9). There is agreement with the 
prior structures when comparing MraY mono- 
mers with backbone RMSDs around 1 A (Fig. 4A 
and fig. S10). We can model previously disor- 
dered regions of MraY including all the cyto- 
plasmic loops that enclose the active site. Loop 
1-2 likely adopts distinct conformations during 
the catalytic cycle and indeed has two slightly 
different conformations in our dimer (fig. S6C). 
Loop 9-10 adopts a conformation that is simi- 
lar to the reported small-molecule inhibitor 
co-crystal structures of MraY (26, 27, 37) but 
distinct from the apoprotein structure (Fig. 4 
and fig. S10). This conformation of loop 9-10 
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RMSD 
e@EcMraY - 
e AaMrayY (6OYH)|0.966 
eAaMraY(4J72) | 1.081 
EbMraY(5JNQ) | 1.024 
* N-terminus 


2 Cytoplasm 


Fig. 4. Features observed in the EM structure of E. coli MraY. (A) Cytoplasmic view of a structural 
alignment of EcMraY (dark cyan) against uninhibited AaMraY [green, PDBID:4J72 (25)] and carbacaprazamycin 
inhibited AaMraY [pink, PDBID:60YH (26)]. RMSDs to monomer A of EcMraY are shown. The color scheme for 
the various MraY crystal structures is used throughout the figures. (B) A view in the plane of the membrane 
showing the region that includes TM1 and TM2 in backbone ribbons. (Left) The EcMraY structure colored 
from the N terminus through TM2 in the Viridis color scheme. (Right) The inhibited AaMraY structure 

from (A) and the EbMraY structure [gray, PDBID:5JNQ (27)]. Each structure is aligned to EcMraY. The location of 
each N terminus is indicated by an asterisk. (©) Accessible surface of EcMraY (cyan) and unmodeled 
densities (orange) that are likely lipids or detergent as well as a putative Co5P molecule (purple, see below). 
The inset is a view of the periplasmic cavity viewed from a removed monomer. 


may be a general feature of MraY complexed 
with inhibitors. 

In prior crystal structures, the N terminus of 
MraY begins at either TM1 or is a helix that 
projects away from the structure in an orienta- 
tion incompatible with the bilayer (Fig. 4B). In 
the YES complex, the N-terminal end of the first 
helix (NTH) hydrogen-bonds to the C-terminal 
end of TM2, effectively forming a helical stack- 
ing structure (Fig. 4B and fig. S9C). A multiple 
sequence alignment across bacteria shows 
that the NTH in MrayY is conserved across gram- 
negative bacteria but missing in Gram posi- 
tives (figs. S7 and S11). Although this feature is 
not found in the crystal structures, AlphaFold 
(38) predicts the NTH stacking for E. coli and 
other Gram-negative bacteria with slight dif- 
ferences in hydrogen bonding and orientation 
relative to the cryo-EM structure (fig. S11). For 
the AaMraY structures (25, 26), the positioning 
of the N terminus is likely a product of crys- 
tallization, as AlphaFold predicts the NTH stacks 
for this and the related Hydrogenivirga spe- 
cies (fig. S11). For Gram-positive bacteria, both 
the EbMraY and predicted structures lack the 
NTH (27) (Fig. 4B and fig. S11). 


Protein E as a general antibacterial protein 

Expression of protein E leads to lysis that 
results in a subset of killed bacteria becom- 
ing “ghosts”—essentially empty cell walls (5). 
Ghosts have been used in a variety of contexts 
including vaccine development [reviewed in 
(39)]. For ghosts to be made, protein E must 
inhibit the native MraY of the target bacteria. 
Although phage ®X174 is restricted to E. coli, 
many bacteria have been probed for ghost for- 
mation by expression of protein E. All Gram- 
negative bacteria tested (fig. SI1A) resulted in 
the formation of ghosts [reviewed in (40)]. By 
contrast, protein E was unable to cause lysis 
in the Gram-positive Staphylococcus carnosus 
(41). Expression of the Bacillus subtilis MraY 
in E. coli prevented cell lysis, suggesting that it 
has a low affinity for protein E (35). Compar- 
ing the residues in EcMraY that contact pro- 
tein E to those in Gram-positive species, there 
are no single residue differences that easily 
explain the inability to inhibit MraY from gram- 
positive species. There are a few residues that 
may play a role—such as P170 in EcMraY which 
confers resistance when mutated to leucine— 
that are absent in Gram-positive MraYs (fig. $7). 
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EcMraY residue Y134, which forms a bridge 
with protein E M26 in our model (Fig. 2C), is 
also missing in Gram-positive species (fig. S7). 


Lipids bound to the YES complex 


The YES complex was solubilized from its na- 
tive environment and, for the YES;p2; complex, 
the final detergent solution was supplemented 
with E. coli lipids. We observe lipid densities 
around the membrane surface of MraY (Fig. 
1D and fig. S12, A and B). Prior work supports a 
functional role for anionic phospholipids with 
MraY including stabilizing the dimer (42-44). 
We observe substantial lipid density near the 
dimer interface with features consistent with 
glycerophospholipids, although not all density 
can be clearly assigned (fig. S12). As in previous 
structures (25, 27), there is a hydrophobic peri- 
plasmic cavity within the MraY dimer that con- 
tains unexplained density (Fig. 4C). Although 
this density has clear structure, we are unable 
to fit typical E. coli phospholipids. 

Recently, a study using native mass spec- 
trometry and molecular dynamics identified 
a binding site for C;;P at the dimer interface 
with the phosphate headgroup predicted to 
form a salt bridge to R341 in EcMraY (44). In 
our structure, we observed a long tube of den- 
sity at the dimer interface (fig. S12A, purple) 
that ends at R341 and is consistent with C;;P 
(fig. S12C). Although we cannot confidently iden- 
tify this lipid, the density would be consistent 
with C;;P, although the role for this binding 
site remains to be determined. 


The role of SlyD in protein E-mediated lysis 

The amphipathic helix of protein E bridges the 
two E. coli proteins, MraY and SlyD, which 
make no specific contact with each other (Fig. 
2A). The IF domain sits on the hydrophobic 
face of the protein E helix (Fig. 5A) and con- 
tacts the extended C terminus of the oppos- 
ing protein E, which then extends to bind 
the FKBP domain (Fig. 5B). This results in a 
bowtie-like interaction with each SlyD binding 
both protein E-soluble domains (Fig. 1E). There 
are no contacts between the two SlyD mol- 
ecules (fig. S6D). These protein E interfaces 
validate structural studies of SlyD where pep- 
tides bind to each of the interfaces seen here 
including B-augmentation in a groove in the IF 
domain (Fig. 5, C and D) (32). The IF domain 
binding to an a-helix may be an important 
chaperoning interaction. The FKBP domain is 
well-ordered but adopts a range of orientations 
in our particles (fig. S13) and consequently has 
the lowest resolution (fig. S3). This flexibility 
relative to the IF domain is consistent with the 
NMR structure of SlyD where the orientations 
of the two domains are not constrained by each 
other (37). The only contact of the FKBP do- 
main to the rest of the YES complex is the 
flexible linker to the IF domain and binding to 
the extended C terminus of protein E. We ob- 
serve continuous backbone density that allows 
us to place P65 at the FKBP active site. As seen 
before for some SlyD-bound peptides (32), pro- 
tein E binding to the FKBP domain adopts a 
noncanonical orientation. Protein E P65 is not 


A\\) so peptide 


TtSlyD PDBID: 70x! 


Fig. 5. Interactions between protein E and SlyD. (A) The amphipathic helix of protein E (yellow) as a 
illustration with side chains contacting SlyD shown as sticks. SlyD (purple) is shown as an illustration along 
with a transparent accessible surface (B) Full view of SlyD bound to two protein E molecules shown in 
different shades of yellow. (C) As in (A) highlighting the B-augmentation of the extended C-terminal protein 
E. (D) Structures of SlyD from the YES;p2; complex and TtSlyD [PDBID:70X! (33)] (green) aligned to 

the IF domains. The two S2 peptides bound to the TtSlyD are shown in pink. 
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completely conserved (fig. S1), although other 
species have a proline at residue 63 which could 
reach the active site with additional tilting of 
the FKBP domain. A proline near this posi- 
tion may help to localize the chaperone to the 
complex and facilitate assembly. 

The lack of contacts between SlyD and MraY 
suggest that the soluble domain of protein E 
alone could form a complex with SlyD. We 
coexpressed EcSlyD with N-terminal trunca- 
tions of protein E from either ID21 (residues 
33 to 76) or the shortest isoform a3 (residues 
33 to 75). Both form complexes that could be 
purified by an affinity tag on the soluble do- 
main (fig. S2E). These observations point to 
a high-affinity interaction between SlyD and 
the C-terminal domain of protein E. 

Protein E is unstable in the absence of SlyD 
(8) and is rapidly degraded (23). It has been 
speculated that prolyl-isomerization is central 
to the lysis mechanism (22), but thisideaisnot | 
supported by available evidence. Nonproline 
mutations in protein E can rescue lysis in a 
AslyD background (18, 45), as well as completely 
replace the cytoplasmic domain of protein E 
with an unrelated globular protein (20, 27, 45). 
We also performed our lysis assay with sev- 
eral SlyD variants in the AslyD strain (fig. S2C). 
As before, protein Egx174 was unable to pro- 
mote lysis in the absence of SlyD, however 
lysis could be rescued by expression of either 
EcSlyD or EcSlyD,54. Thermus thermophilus (Tt) 
SlyD has high structural homology to EcSlyD 
(Fig. 5D and fig. S2, B, F, and G) and also res- 
cued lysis in the AslyD strain (fig. S2C). We 
generated an EcSlyD Y68K mutant that sub- 
stantially reduces prolyl-isomerase activity 
(46) and this too could rescue lysis. Finally, 
we purified a complex of protein E using the 
Epos rescue mutants (R3H and LI19F) (45) in 
the AslyD strain. This YE complex was very 
unstable and aggregated according to SEC (fig. 
S2H). These results support the conclusion that 
the primary role of SlyD in promoting lysis is not 
prolyl-isomerization, but to protect protein E 
and stabilize the YES complex. This does not 
obviate a role for proline binding in complex 
assembly. Mutation of P65, in the FKBP active 
site, creates a slower-lysis phenotype (22), which 
may indicate that binding of these residues by 
SlyD is key to assembling the YES complex. 


Protein E is evolutionarily constrained 


Protein E arrived late in the evolution of ®X174 
and was overprinted into a +1 reading frame 
in the ORF for gene D (47, 48). The structure 
supports that this embedding constrains the 
evolution of protein E (72). Considering the 
sequence changes across protein E isoforms, 
we note that from the N terminus through 
residue 70, with few exceptions, each position 
that is not completely conserved is either si- 
lent or that there is only a slight change in 
protein D (fig. S8A). The silent changes vary 
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the codon’s second position, which is the wob- 
ble position in the overlapping codon for pro- 
tein D. For example, residue W7 in ®X174 is 
replaced with a Ser or Leu in other species. 
Although seemingly substantial changes, each 
is coded by the sequence UXG and all three 
variations at this position are sampled. These 
variable positions generally do not contact 
MraY. An exception is found in the G4 isoform 
where a phenyalanine occurs at position 19, as 
in the Epos mutant, which results in a change 
from alanine to serine at position 79 in protein 
D. This places a polar residue in the hydro- 
phobic core of protein D (fig. S8). This single 
nucleotide change increases protein E affinity 
to MraY but likely lowers the stability of pro- 
tein D and the overall fitness of the virus. In- 
troduction of this mutation results in smaller 
plaque formation relative to the wild type (23). 
The C terminus of protein D and the exten- 
sions in the longer isoforms of protein E are 
likely disordered and less constrained, hence 
the increased sequence variability (fig. S8). 


Conclusions 


The mechanism for ®X174 inhibition of pep- 
tidoglycan biosynthesis is suprisingly simple. 
Protein E binds in the active site cleft prevent- 
ing access of the lipid substrate and blocking 
conformational changes needed for catalysis 
(fig. S14). The YES complex structure provides 
a template for interpreting all the previous 
results for lysis by ®X174. Notable among these 
are that the extensive functional mutations 
in protein E and MraY map to the interface 
between the two proteins. SlyD binds to the 
cytoplasmic domain of protein E to stabilize 
the MraY/protein E complex. As we demon- 
strated, the requirement for SlyD is based on 
the protein binding properties, as distantly re- 
lated SlyD homologs can rescue lysis in a AslyD 
strain. SlyD in this case does not serve directly 
in the mechanism of inhibition of MraY and 
can be functionally replaced. 

Our structures offer some insights for future 
work applying SGL proteins as therapies. Pro- 
tein E binding to EcMraY identifies a distinct 
mechanism from current inhibitors. Blocking 
of the lipid substrate access to the active site in 
MraY could be exploited by small molecules, 
with structure-based drug design guided by 
the protein E structure. Expanding protein E 
efficacy to Gram-positive bacteria and, potentially, 
other related polyisoprenyl-phosphotransferases 
such as bacterial WecA or the mammalian 
DPAGTI could further increase applicability. 
SGL proteins, which have distinctly evolved 
many times (49), can also be used to improve 
the antibacterial potency of bacteriophages. The 
use of phages for medical therapies, although 
known for a hundred years, has attracted re- 
cent attention (J). Improving the efficiency of 
®X174 protein E based on structural infor- 
mation may help these therapies achieve their 
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greatest potential in the fight against antimi- 
crobial-resistant pathogens. 


Materials and Methods 
Coexpression of EcMraY, protein E, and EcSlyD 


AslyD BL21(DE3)-competent cells were co- 
transformed with pET22b-SlyD,;,4 and either 
pRSFDuet£cMraY-Eqp2; or pRSFDuet-HcMraY- 
Eoxr74 and plated in LB-agar containing 35 ug/ml 
Kanamycin and 100 ug/mL Ampicillin. Our 
pET22b-SlyD,;4 construct expresses E. coli SlyD, 
modified by the removal of the flexible C- 
terminus. The pRSFDuet-EcMraY-Ejpo, plas- 
mid contains the ID21 isoform of protein E, 
along with a wild-type EcMraY to prevent cell 
lysis from the overexpression of protein E. Cells 
were grown in 2xYT media at 37°C, 225 rp.m., 
and induced at an ODgoo of 0.9 with 0.4mM 
IPTG at 18°C overnight. The culture was har- 
vested by centrifugation for 10 minutes at 
9000zg, 4°C then frozen or used immediately 
for purification. 


Purification of the YES complex 


The cells were resuspended in lysis buffer (20mM 
Tris-HCl pH 7.5, 300 mM NaCl, 10% Glycerol, 
5mM £-mercaptoethanol (BME), 0.1ImM PMSF, 
0.1mM Benzamidine) and homogenized using a 
M-110L microfluidizer (Microfluidics). The lysate 
was cleared by a 20-minute centrifugation at a 
speed of 22,000xg. The supernatant was then 
centrifugated at 167,424xg and the resulting 
membrane pellet was then solubilized in the 
extraction buffer [10 mM HEPES pH 7.5, 300 mM 
NaCl, 5% Glycerol, 5mM BME, 0.1mM phenyl- 
methylsulfonyl fluoride (PMSF), 0.1mM ben- 
zamidine, 10 mM imidazole and 1% dodecyl 4-O- 
a-D-glucopyranosyl-f-D-glucopyranoside (DDM)] 
After allowing for extraction for 1.5 hours at 
4°C, the solution was centrifuged at 167,424 
for 30 minutes and the remaining lysate was 
mixed with ImL NiNTA resin (Qiagen, Alameda, 
CA) then nutated at 4°C for two hours. This 
solution was loaded onto a gravity column and 
then washed with five column volumes of wash 
buffer (10 mM HEPES pH 7.5, 150 mM NaCl, 
5% glycerol, 5mM BME, & 0.03% DDM) with 
10mM imidazole followed by five column vol- 
umes of wash buffer with 30 mM imidazole. 
The YES complex was eluted in 20mL of wash 
buffer containing 200 mM imidazole. The final 
purification step was SEC (Superdex 200 10/300 
GL, Millipore Sigma) in 10mM HEPES pH 7.5, 
75 mM NaCl, 5% Glycerol, 5mM BME and 0.03% 
DDM. Fractions were assessed by SDS-PAGE and 
directly used for cryo-EM sample preparation. 


Co-expression of EcMraY and protein E in 
various SlyD backgrounds 


The pRSFDuet-E-EcMraY and pRSFDuet-Epos- 
EcMraY expression vectors were transformed 
into BL21-Star cells (Novagen). Similarly, the 
pRSFDuet-E(C-term)-SlyD,54 was transformed 
into SlyD-knockout cells. The cultures were 


grown at 37 °C to an ODgoo 0.8 and induced 
with 1 mM IPTG. Induced cultures were grown 
for 3 hours followed by harvesting by centrifu- 
gation at 9,000xg for 20 min. Cell pellets were 
resuspended in lysis buffer and lysed by son- 
ication. The lysate was then cleared by cen- 
trifugation at 22,000xg, followed by a second 
centrifugation at 234,78xg for 1 hour to isolate 
the membrane fraction. The complex was ex- 
tracted in 20 mM Tris-HCl pH 7.5, 300 mM NaCl, 
10% Glycerol, 10 mM Imidazole, and 1% n-Decyl- 
B-Maltoside (DM) and incubated at 4°C for 
1.5 hours. The debris was cleared by centrifu- 
gation at 234,788ag for 30 min. The sample was 
incubated with 1 mL NiNTA resin for 1 hour, 
followed by a wash with 50 column volumes 
lysis buffer with 30mM Imidazole. The protein 
E complexes were similarly eluted in 300mM 
Imidazole. The elutions were concentrated and 
further purified by size exclusion chromatogra- 
phy (Superdex 200 10/300 GL, Millipore Sigma). 


Lysis assays of WT protein E ©X174 and ID21 


LEMO DE3 competent cells were transformed 
with a pRSF-Duet vector either empty, with 
protein Egxy74, or protein Eypo;. Cultures were 
grown to an ODg¢oo of 0.2 and inoculated into a 
Corning 96-well Clear Flat Bottom plates in 
100uL triplicate aliquots and induced as de- 
scribed previously. Cultures were incubated at 
37C with orbital shaking at 220rpm using an 
Infinite M Nano+ (Tecan, Switzerland). Readings 
were taken in 5-minute intervals for 90 minutes. 


Lysis assay for protein E constructs 


LEMO DE3 competent cells (New England 
Biolabs, MA, USA) were transformed with a 
pRSFDuet vector either empty, with C-terminally 
FLAG tagged protein Eox174 variants (WT, 
P21A, K46A). The lysis assays were performed 
in three biological replicates as previously de- 
scribed (27). Absorbance readings were recorded 
in 5-minute intervals for 1 hour and 30 minutes. 
Manual readings were taken using a Biowave 
Cell Density Meter CO8000. The values were 
plotted using GraphPad Prism version 9.1.1 
for macOS. 


Lysis assays based on SlyD variants 


AslyD (18) cells were transformed with either 
a control empty pRSF-Duet vector or pRSFDuet- 
Proteingxy74 and either pET22b-EcSlyD, pET22b- 
SlyD,54, DET22b-EcSlyD Y68K, or pET22b-Thermus 
thermophilus SlyD. Cultures were grown in 2xYT 
media at 37°C and induced with 0.4mM IPTG 
once at an ODgoo of 0.2. Absorbance measure- 
ments were manually recorded in 5-minute 
intervals for 70 minutes. Similarly, AslyD cells 
were transformed with either a control empty 
pRSF-Duet vector or pRSF-Duet-Proteinjp21 
either alone, with pET22bEcSlyD, or with pET22b- 
EcSlyD,54 and induced with 0.4mM IPTG. Read- 
ings were recorded using an Infinite M Nano+ 
plate reader as described above. 
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Sample preparation for cryoEM 

The YES complex was diluted to 5.0 mg/mL in 
10 mM HEPES pH 7.5, 75 mM NaCl, 5% Glyc- 
erol, 5mM BME and 0.03% DDM. Additionally, 
the YES;p2; sample was supplemented with 
1mM E. coli total lipid extract (Avanti Polar 
Lipids, 100600P). Quantifoil holey carbon films 
R1.2/1.3 300 Mesh, Copper (Quantifoil, Micro 
Tools GmbH) grids were glow discharged with 
a 2-minute 20A plasma current using a Pelco 
easiGlow, Emitech K100X. Grids were prepared 
using a Vitrobot (FEI Vitrobot Mark v4 x2, Mark 
v3) by applying 3uL of sample onto the grid 
followed by a 3.5 second blot using a +8-blot 
force and plunge frozen into liquid ethane. 


Data acquisition and analysis 


The grids were imaged in a 300 kV cryo-TEM 
microscope equipped with a Gatan K3 6k x 4k 
direct electron detector and a Gatan Energy Filter 
(slit width 20eV) in super-resolution mode using 
Serial EM. Datasets were collected at a 105k 
magnification with a pixel size of 0.416 A/pixel. 
Movies with 40 frames were recorded with a 
total exposure dose of 60 e/A? and a defocus 
range of -1.0 to -2.5 um. For the YES;p5; complex, 
a total of 12,070 movies were recorded. Movies 
were normalized by gain reference and mo- 
tion corrected using the patch motion correc- 
tion built in function in cryosparc (v3.3.2) with 
a twofold bin that resulted in a pixel size of 
0.832 A/pixel (50). The contrast transfer func- 
tion (CTF) was estimated using CTFFIND4 (57). 
Micrographs were manually curated, and low- 
quality images were removed for further analy- 
ses. A total of 2,462,335 particles were obtained 
followed by the generation of 6 ab-initio mod- 
els. Out of the 6 models, two models are se- 
lected for classification into “good” and “trash” 
volumes. All particles were then sorted in these 
two volumes through heterogeneous refinement 
using particles extracted with a 4x bin, which 
produced 6,589,696 good particles. Heteroge- 
neous refinement was used in an iterative man- 
ner to sort the particles into the 5 volumes (4 
good and 1 trash). The 1,151,777 good particles 
were used for non-uniform homogeneous re- 
finement to generate a higher resolution vol- 
ume. The particles were then extracted with 
a 3x bin and sorted into 4 iterations of the 
higher resolution volume and 1 trash volume. 
Iterative rounds of heterogeneous refinement 
at 3x bin produced 935,754 particles. Particles 
were then extracted in a 2x bin and heteroge- 
neously refined into either high- or low-resolution 
volumes. At this point, discerning features in 
the soluble region of the model were used to 
select the most complete volumes. The volumes 
were individually refined through non-uniform 
refinement and the particles that composed the 
volumes with most complete and highest reso- 
lution were used. A total of 122,452 particles 
were used for the most complete model obtained 
upon non-uniform refinement. The FSC-masked 
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resolution was 3.5A, while the unmasked reso- 
lution was 3.9A. The half-maps were then used 
for post-processing through DeepEMhancer 
(52) with the high-resolution model selected for 
our most complete density map. Post-processing 
through DeepEMhancer removed the micelle 
and improved the features on the soluble por- 
tions of the map, however the lipid densities 
were also removed. The lipid densities described 
in this work are those of the YES;p2; map before 
post-processing. Notably, the dimer-interface 
lipid density was also present in the YES@xq74 
density map without the supplementation of 
E. coli lipid extract. Figure 1D uses the densities 
before post-processing for the MraY dimer, mi- 
celle and lipid densities, and the DeepEMhancer 
post-processed map for protein E and SlyD. 
Supplemental figures S9 and S12 were made 
with the map before DeepEMhancer sharpen- 
ing. The YESgx774 complex dataset was pro- 
cessed in this same manner. A total of 10,798 
movies were recorded. The model for the YESjpo; 
complex was then used as a template for tem- 
plate picking, from which 1,516,368 particles 
were picked and curated. Following gradual 
un-binning and sorting into good and trash 
volumes, 155,270 particles were used for the 
final iteration of non-uniform refinement. The 
local resolution of both maps was performed 
on cryosparc (v3.3.2). The half-maps were then 
post-processed using DeepEMhancer as de- 
scribed previously. 


Model building 


For starting models we used the Aquifex aeolicus 
un-bound structure (PDBID:4J72) (25) for EcMraY 
and the E. coli NMR structure (PDBID:2K8I) (37) 
for EcSlyD which were fit using phenix.dock. 
SlyD was then split into its two domains, IF 
and FKBP, at residues Y68 and G127. Protein E 
was modeled de novo up to residue P65 using 
Coot 0.8.9.2. The structures of the EcMraY, pro- 
tein E, and EcSlyD-IF domain were refined using 
phenix.real space refinement and ISOLDE 1.6, 
and validated with PHENIX-1.19.2. After the re- 
finements of EcMraY, protein E, and the EcSlyD-IF 
domain were completed, the FKBP domains 
were docked into density using ChimeraX. The 
complete YES complex structure was then re- 
fined using PHENIX-1.19.2 and ISOLDE 1.6. 
RMSDs were calculated using ChimeraX Match- 
maker chain alignment. Structure figures were 
made using ChimeraX and sequence alignments 
using Jalview and ClustalW (53-55). 
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Quantum metric nonlinear Hall effect in a topological 
antiferromagnetic heterostructure 
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Quantum geometry in condensed-matter physics has two components: the real part quantum metric 
and the imaginary part Berry curvature. Whereas the effects of Berry curvature have been observed 
through phenomena such as the quantum Hall effect in two-dimensional electron gases and the 
anomalous Hall effect (AHE) in ferromagnets, the quantum metric has rarely been explored. Here, we 
report a nonlinear Hall effect induced by the quantum metric dipole by interfacing even-layered 
MnBizTe, with black phosphorus. The quantum metric nonlinear Hall effect switches direction upon 
reversing the antiferromagnetic (AFM) spins and exhibits distinct scaling that is independent of the 
scattering time. Our results open the door to discovering quantum metric responses predicted 
theoretically and pave the way for applications that bridge nonlinear electronics with AFM spintronics. 


onlinearities are crucial in many branches 
of physics, ranging from atomic physics 
to condensed-matter and complex dy- 
namical systems. Nonlinear electrical 
transport is the foundation of applica- 
tions such as rectification and wave mixing. 
Classically, the most well-known nonlinear 
device is a PN diode (Fig. 1A). Noncentrosym- 
metric polar materials (Fig. 1B) are similar to 
PN diodes as they both possess an electric 
dipole. They have recently been discovered to 
show intrinsic nonlinear electrical transport, 
which may not only lead to new nonlinear 
applications but also provide a powerful probe 
of the quantum geometry of the conduction 
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electrons (J-16). Broadly, the nonlinear trans- 
port in both diodes (Fig. 1A) and noncentro- 
symmetric conductors (Fig. 1B) arises from an 
inversion asymmetric charge distributions 
(e.g., an electric dipole). Because the electron 
has another fundamental degree of freedom, 
spin, an interesting question is whether spin 
can also lead to an electrical nonlinearity even 
in a centrosymmetric lattice. One ideal platform 
is the class of parity-inversion time-reversal 
(PT)-symmetric antiferromagnets (AFMs) (77), 
where only the spins feature a noncentrosym- 
metric distribution (Fig. 1C). 

Important clues can be drawn from previous 
optical experiments, in which optical second- 
harmonic generation (SHG) has been observed 
in the PT-symmetric AFMs, including Cr2O3 
and CrI, (78). Nevertheless, nonlinear transport 
is distinct because it directly probes the Fermi 
surface electrons and in many cases their geo- 
metrical properties (/, 2). As such, it enables a 
probe of the quantum geometry (J, 2) of the 
topological bands at the Fermi level. 

The quantum geometry has two parts, T = 
g-i/2Q (1) (Z is the quantum metric tensor). The 
imaginary part is the well-known Berry curva- 
ture Qo, = —2Im} > [(Un| EOk,, Urm)( Uhm | Eke, Un] 5 

men 
which describes the curvature of wave func- 
tion in Hilbert space (n, m are band indices 
and a, B are spatial directions). Berry curva- 
ture has been identified as the source of many 
unusual electronic and optical responses. By 
contrast, the real part is the quantum metric, 
up = Re)~ [(Un|tOn,, Um)( Um | tx, Un)], Which 
men 
measures the distance between neighboring 
Bloch wave functions in Hilbert space [i-e., 


q 


see section IV.1 of (Z9)]. Although equally int ice 
tant, the quantum metric is much less exple--—-— 
There have been a few examples, including the 
orbital magnetic susceptibilities (20), a third- 
order Hall effect (73), and the quantum metric 
in atomic physics (27). However, the way quan- 
tum metric regulates electronic motion remains 
largely unknown. Recently, theories have pre- 
dicted a wide range of exotic quantum metric 
responses (20, 22-34). 

Here, we report the observation of the quan- 
tum metric dipole-induced second-order anom- 
alous Hall effect (AHE) (20, 22-25). In past 
decades, there have been numerous studies of 
the AHE (both linear and nonlinear) induced 
by Berry curvature. Recent theoretical studies, 
however, predict that the quantum metric can 
also lead to AHE, thereby advancing our under- 
standing of the fundamental origin of the AHE. 
Distinct from the Berry curvature-induced 
AHEs, this effect is predicted to exist in the 
PT-symmetric AFMs (20, 22-25), where PT 
forces the Berry curvature to vanish identically 
but the effects of quantum metric can manifest 
strongly. We design and fabricate a feasible 
material platform and demonstrate the ap- 
pearance of the quantum metric nonlinear 
Hall effect (NHE). To conceptualize this type ‘ 
of NHE, we draw comparison with the well- 
known AHE in ferromagnetic metals (35), where 
Berry curvature leads to the anomalous veloc- 


ity and therefore the AHE, Vanomalous cc Ky x 
k 


Q (E is the in-plane source-drain electric field). 
By contrast, in a PT-symmetric AFM, Berry 
curvature is zero as dictated by PT. However, 
a nonzero quantum metric g can induce an 
anomalous velocity to second order in E), 


Vanomalous ~[E, x [Vi x (gE)) | , as proposed 


in (20). This leads to an intrinsic second-order 
Hall effect. From the expression above, one 
can show that this effect is nonzero only when 
the system breaks both P and T. Therefore, we 
need PT-symmetric AFM conductors with a 
large quantum metric on the Fermi surface. We . 
have carefully considered possible materials, 
and identified two-dimensional (2D) even- 
layered MnBioTe, (15, 36-46) as an ideal plat- 
form. Even-layered MnBi.Te, is a PT-symmetric 
AFM. Moreover, its topological bands support 
gate-tunable transport and a giant quantum 
metric (19). However, its lattice has C3, rotational 
symmetry (Fig. 1, D and E), which forces the 
effect to vanish (22). To break C3,, we interface 
MnBi,Te, with black phosphorus (BP) (47). 


Demonstration of rotational 

symmetry breaking 

We start by showing that interfacing MnBisTex. 
with BP indeed breaks its C;., rotational symme- 
try. To this end, we study the directional depen- 
dence of the resistance (6) of MnBi,Te, without 
and with BP. We fabricated a six-septuple-layer 
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(6SL) MnBi,Te, device with radially distributed 
electrical contacts (Device-BM1). The four-probe 
resistance (T = 1.8 K) is found to be isotropic 
(Fig. 1G, blue curve), consistent with the C3, 
symmetry. We then stacked a BP layer (~10 nm) 
onto this MnBi,Te, sample and performed the 
measurements again. The resistance develops 
a clear anisotropy with a 180° periodicity (Fig. 
1G, red curve), providing a clear signature of the 
breaking of C3, [section I.4 of (29) shows that 
the transport is dominated by the MnBi,Te, 
layer of the heterostructure]. The transverse 
resistance and two-probe resistance also show 
the breaking of C3, [fig. S12 of (19)]. We further 
substantiate the breaking of C3,, symmetry by 
an independent method, the optical SHG, at 
room temperature. As shown in Fig. 1H, our 
SHG data also show the clear breaking of C3, 
symmetry [see detailed discussions in section 
1.5 of (19)]. Our demonstration of C3, breaking 
establishes the BP/MnBi,Te, heterostructure 
as an ideal platform to search for this effect. 


Observation of the nonlinear Hall effect 


To measure the linear and nonlinear electrical 
transport, we pass a current at frequency w ([”) 
and use the lock-in technique to detect linear 
voltage V® and nonlinear voltage V2°. We 
describe the nonlinear voltage as Ves where 
zis the direction of the nonlinear voltage V7 
and j,k are the directions of the injected 
current J®. All measurements are performed 
at B= 0. 

Figure 11 shows the nonlinear Hall voltage 
Ve, of the Device-BM1 before and after being 
interfaced with BP. A prominent nonlinear 
Hall signal emerges only after BP is intro- 
duced. This is in sharp contrast to the linear 
voltage (Fig. 1, inset), which becomes even 
slightly smaller upon the introduction of BP. 
Such observation agrees well with the the- 
oretical expectation of the intrinsic NHE in- 
duced by a quantum metric dipole. To exclude 
that the effect is caused by a Berry curvature 
dipole (4, 6, 7, 9), we study the relationship 
between the second-order NHE and the AFM 
order in MnBi,Te,. 


The AFM spin-induced nonlinearity 


Overall, we have fabricated 30 BP/even-layered 
MnBi,Te, heterostructure devices [see section 
1.0 of (79) for our systematic data that con- 
firm the MnBi,Te, thickness in our devices]. 
In all of the 30 devices, we have observed the 
NHE with consistent behaviors as a function 
of AFM order, spatial direction, scattering time, 
vertical electric field, and doping [see fig. S22 
and table S1 for a summary of all 30 devices 
(19)]. Here, we focus on the Device-BMB1 (Fig. 
2A), which has two-layer BP on both sides of 
six-septuple-layer MnBi,Te,. Moreover, we have 
ensured that the crystalline a axes of the BPs 
and the MnBi,Te, are aligned (Fig. 2A). Such a 
carefully controlled configuration is important to 
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Fig. 1. Spin-induced electrical nonlinearity in PT-symmetric antiferromagnets and introduction to 
our sample. (A and B) Nonlinear electrical transport in PN junctions and noncentrosymmetric conductors 
(charge-induced electrical nonlinearity). (©) Nonlinear electrical transport in PT-symmetric AFMs (spin- 
induced electrical nonlinearity). (D to F) Lattice structures of the MnBiTe, and BP. (G and H) Angle-resolved 
resistance and optical SHG measurements of a 6SL MnBizTe, before and after being interfaced with BP. 
(I) The nonlinear Hall signal V2 before and after being interfaced with BP at B = 0 T. Inset: The linear 
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longitudinal voltage V® before and after being interfaced with BP. All data in Figs. 1 to 4 were taken 
at temperature T = 1.8 K unless otherwise noted. Data in Fig. 1 and Fig. 2B are from Device-BM1. Data in 
Fig. 2, E to J, Fig. 3, and Fig. 4A are from Device-BMBI1. Data in Fig. 4F are from Device-BM21. 


preserve MnBioTe,’s PT symmetry, which en- 
sures that the Berry curvature and Berry cur- 
vature dipole vanish. Figure 2B shows a large 
transverse nonlinear response V,’,. We have 
also measured the longitudinal nonlinear re- 
sponse V2, which shows no observable signal. 
Therefore, our data reveal an interesting “Hall 
dominance” in the nonlinear transport. 

We now focus on exploring how the non- 
linear Hall signal depends on opposite AFM 
states. In ferromagnets, the opposite ferromag- 
netic states can be controlled by sweeping the B 
field. In PT-symmetric AFMs including Cr.Os, 
even-layered CrI3, and even-layered MnBizTe, 
(44, 48, 49), previous works have shown that 
the opposite AFM states can be controlled by 
sweeping the vertical B, field under a fixed 
vertical E, field. Hence, we follow the pre- 
viously established procedures (44): Under a 
fixed E, (E, = -0.17 V/nm), we sweep B, from 
-8 Tto 0 Tor from +8 T to 0 T to prepare the 
two AFM states (Fig. 2, C and D). We first 


study the AFM I. The linear voltage V®, (Fig. 
2E) exhibits a typical ohmic behavior. The 
nonlinear voltage Vie, (Fig. 2G) is prominent 
and its sign is positive. We then prepare AFM 
II. The linear voltage V°, (Fig. 2F) remains 
unchanged. In sharp contrast, the nonlinear 
voltage Vie. (Fig. 2H) flips sign. For both 
AFM I and II, the NHE is only present in the 
AFM phase (Fig. 2, I and J). Our observation 
that the nonlinear Hall signal flips sign upon 
reversing the AFM order further demonstrates 
its quantum metric dipole origin, because 
the quantum metric dipole is theoretically ex- 
pected to be opposite for the opposite AFM 
domains [see section III of (19)]. Our non- 
linear Hall signal measures an average over 
all AFM domains. However, our experiments 
suggest that our sample is prepared into pre- 
dominantly one domain. If our sample con- 
sisted of opposite domains with a 50%-50% 
composition, then the measured nonlinear 
Hall signal would average to zero. By contrast, 
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Fig. 2. Observation of the antiferromagnetic NHE. (A) Schematic illustration of 2L BP/6SL MnBizTe4/2L 
BP device (Device-BMB1). The crystalline a axes of the BPs and the MnBizTes were all aligned [fig. S16 


of (19)]. (B) The longitudinal (V2 yr 


) and Hall (V2) components of the nonlinear voltage. (C and D) We follow 


the procedures established by previous works (44): Under a fixed E, (-0.17 V/nm), we sweep B, from -8 T 

to OT or from +8 T to 0 T to prepare the two AFM states. (E and F) Linear longitudinal voltage as a function of 
incidence current for AFM | and AFM Il. (G and 1) Nonlinear Hall voltage as a function of incident current 
and temperature of AFM I, respectively. (H and J) The same as panels (G) and (I) but for AFM Il. 


our data show a large nonzero nonlinear Hall 
signal. Moreover, the sign of the observed sig- 
nal flips as we prepare the opposite AFM do- 
main. Further, the magnitude of the measured 
signal is consistent with the theoretically cal- 
culated value, which assumes a single domain. 
Spatially resolved magnetic measurements 
will be needed to determine the exact domain 
composition. 

We now perform further systematic studies. 
Because the nonlinear Hall current flips sign 
upon reversing the AFM order, all the non- 
linear Hall data (apart from Fig. 2) are ob- 
tained by taking the difference between the 
two AFM domains. First, the intrinsic NHE is 
expected to be independent of the scattering 
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time, unlike other related responses. Similar 
to the intrinsic AHE in ferromagnetic metals 
(35), and in contrast to the quantum AHE, there 
is still dissipation through the linear Drude 
conductivity o,,, The nonlinear Hall conduc- 
tivity can be directly extracted from our data 
bya = 0 ee = (VAP eR wd) 
where /, w, d are the length, width, and thick- 
ness of the sample. Previous experiments have 
studied the scattering time t dependence of 
various Hall effects (6, 9, 14, 35) by investigat- 
ing the scaling between the corresponding 
Hall conductivity and the Drude conductivity. 
Following this established method, we study 
the scaling between of. ando,,. Our data (Fig. 
3A) show that 07°, is independent of oz, below 


~15 K. Moreover, consistent results have been 
observed at multiple charge densities in the 
same sample and from different samples [sec- 
tion III.9 of (19)]. These systematic data point 
to the conclusion that the 07°, is independent 
of scattering time t below ~15 K. Above ~15 K, 
07°, vanishes quickly across Néel temperature 
Ty because the AFM order vanishes and our 
NHE only exists in the AFM phase. Hence, 
studying the t dependence at temperatures 
near 7y would require one to take the strong 
influence of the AFM order near Ty into ac- 
count [see section III.8 of (19) for additional 
measurements and analysis]. Second, the in- 
trinsic NHE does not require a noncentrosym- 
metric lattice or any explicit breaking of PT 
symmetry. To test this, we explicitly break PT 
by applying a vertical E, field via dual gating. 
The nonlinear Hall signal is already promi- 
nent even at E, = 0 (Fig. 3D), confirming that it 
does not require any PT breaking. Moreover, 
the nonlinear Hall signal is symmetric for +£., 
also consistent with the expectation [see 
section IV.2 of (79)]. Third, the NHE is ex- 
pected to be sensitive to the direction of the 
incident current J®. In Fig. 3B, we measure the 
nonlinear Hall conductivity as a function of 
the direction of 7°. Indeed, the signal is most 
prominent when /® is along a particular in- 
plane direction. In this way, we experimentally 
mapped out the direction of the relevant geo- 
metrical dipole (the quantum metric dipole in 
our case, as we demonstrate next). Moreover, 
the intrinsic NHE is found to be independent 
of frequency [fig. S23 of (19)]. In principle, the 
frequency independence is expected to per- 
sist until w is large enough to induce an 
interband transition (roughly terahertz or far- 
infrared). Future experiments are needed to 
test the NHE in that regime. 


Excluding competing mechanisms 


Although we tried to eliminate Berry curva- 
ture dipole by aligning the crystalline a axes 
between BPs and MnBi,Te, to preserve PT 
symmetry (Fig. 2A), let us assume that the 
alignment is imperfect, so Berry curvature di- 
pole is allowed. We now show that the observed 
relationship between the nonlinear Hall sig- 
nal and AFM order can distinguish between 
Berry curvature dipole Dgery and quantum 
metric dipole Dyetric- Dperry Can be understood 
as a distribution of the Berry curvature around 
the Fermi surface such that it is larger on one 
side of the Fermi surface than on the opposite 
side. A similar picture holds for Dyetric (Fig. 3). 
As we observe that the nonlinear Hall signal 
changes sign upon the reversal of AFM order, 
the dipole that causes our observed nonlinear 
Hall signal must also flip. Let us assume that 
the AFM I has Dgerry > 0 and Dwmetric > 0, 
which is visualized in a tilted gapped Dirac 
band structure in Fig. 3, E and G. We now flip 
the AFM order to the AFM II by performing 
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time reversal T. Under T, the bands are flipped 
between +k (Fig. 3, F and H), the Berry cur- 
vature flips sign, but the quantum metric keeps 
the same sign. Hence, from Fig. 3, F and H, one 
can see that, Dgeny(AFMII) = Dgeny(AFMI) , 
but Detric (AFMII) = —Dwetric (AFMI). There- 
fore, our observation that the nonlinear Hall 
signal flips sign upon reversing the AFM order 
excludes the Berry curvature dipole mecha- 
nism. In section II.1 of (19), we enumerate ex- 
perimental results, including the relation with 
AFM order, scaling, vertical electric field de- 
pendence, and relation with mirror symmetry, 
which corroborate that the Berry curvature di- 
pole mechanism cannot account for our data. 
Within the nonlinear effects that flip sign 
upon reversing the AFM order, there is an- 
other possibility, the second-order Drude ef- 
fect (5, 12, 17, 22). This effect is expected to be 
proportional to 7? (22) and therefore can be 
ruled out on the basis of our scaling data in 
Fig. 3A. Moreover, the NHE is antisymmetric 
(upon exchanging the first two indices), aby = 
—oNHE | but the second-order Drude effect 


OL" J 
(SODE) is symmetric, of)" = of¢) (22). Using 
an electrical sum-frequency generation meth- 
od [section II.2 of (19)], we showed that our 
signal is indeed antisymmetric, 07%, = —07?,, 
which demonstrates that the SODE is insig- 
nificant in our signal. In section II.2.3 of (79), 
we present additional data that suggest that 
the NHE is dominant over the SODE at dif- 
ferent temperatures and charge densities. In 
section III.5 of (19), we show that the nonlin- 
ear Hall signal is negligibly small at +8 T be- 
cause the forced ferromagnetic state recovers 
inversion symmetry. We also carefully addressed 
other competing origins such as thermal and 


accidental diode junctions [section II.3 of (19)]. 


Energy-resolved probe of quantum metric 
in PT-symmetric antiferromagnets 


We also study the evolution of the nonlinear 
conductivity on with the charge density. The 
nonlinear Hall signal is zero inside the charge 
neutrality gap (Fig. 4A). This is consistent with 
the expectation that the NHE is a Fermi sur- 
face property. As we tune the Fermi energy 
away from the charge neutrality, the nonlinear 
Hall signal emerges. Notably, the conductiv- 
ities in electron and hole regimes have the same 
sign. As we go deeper into the electron-doped 
regime, the signal reverses sign. 

We now provide an intuitive physical picture 
to understand the large quantum metric dipole 
and its Fermi-level dependence. MnBi,Te,, fea- 
tures Dirac surface states, which are gapped 
owing to the AFM, leading to a large quantum 
metric near the gap edge. Moreover, because 
the AFM order breaks both 7 and P, the Dirac 
bands are asymmetric about k = 0 (Fig. 3G). 
Hence, at a fixed energy, positive and negative 
momenta have a different quantum metric, 
leading to a nonzero quantum metric dipole. 
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Fig. 3. Systematic investigation of the NHE. (A) The scaling between the nonlinear Hall conductivity 
and the Drude conductivity oy = 1/Ryy. The nonlinear Hall conductivity can be directly extracted from the 
data as of = Je /E°? = (VE) /(I2?R8,w"d). (B) Angular dependence of the nonlinear Hall conductivity in 
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Device-BM1. (C) Dual-gated resistance map of the 2L BP/6SL MnBizTe,/2L BP heterostructure (Device- 
BMB1). The vertical electric field E, and charge density dependence can be independently tuned by combining the 
top and bottom gate voltages. (D) F, dependence of the nonlinear Hall conductivity and linear longitudinal 
resistance. E, follows the dashed line in (C). (E to H) Schematic illustration of the Berry curvature dipole 


(Dgenry) and the quantum metric dipole (Dyetric) for the AFM 


and AFM Il of the BP/6SL MnBizTe,/BP 


heterostructure (see text). Although we aligned the crystalline axes of BP and MnBizTe, in our Device-BMB1 
(Fig. 2A), realistically it is difficult to make the alignment perfect. If the alignment is imperfect and PT 
symmetry is broken, a Berry curvature dipole is allowed. 


Intuitively, we can understand the sign of the 
nonlinear Hall signal by considering whether 
positive or negative momenta have a larger 
quantum metric. We see from Fig. 3G that both 
upper and lower parts of the Dirac cone have 
g(+kr) > g(—kp), suggesting that the non- 
linear Hall signals should show the same sign in 
electron and hole regimes, consistent with our 
data (Fig. 4A). The sign change in the electron- 
doped regime is beyond this simple picture. 


To achieve a more comprehensive under- 
standing, we built an effective model of the 
BP/6SL MnBi,Te,/BP heterostructure [sec- 
tion IV.4-9 of (19)]. Owing to the incommen- 
surability of the BP and MnBi,Te, lattices, 
we need to derive the coupling between the 
Bloch states of the two materials in the real- 
space continuum [i.e., within the extended 
Brillouin zone (BZ)]. The low-energy bands 
are located in the BZ center, so only Bloch 
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Fig. 4. The quantum metric dipole as the microscopic geometrical origin. (A) Experimentally measured 


nonlinear Hall conductivity as a function of charge de 


of Ne 
BP/6SL MnBizTe4/BP heterostructure calculated with 


nsity. (B) Theoretically calculated ae as a function 


based on the BP/6SL MnBizTe,/BP band structure (see text). (C to E) The electronic structure of the 


an effective model (see text). (C) Fermi surface at 


-50 meV (the lower part of the surface Dirac cone). The coupling between MnBizTe, and BP is turned off, so 
that contour respects C3, symmetry. The quantum metric gy. and gyx plotted around the Fermi surface. 


(D) The nonlinear Hall conductivity is given by the int 


egral Duetric = I(Yweu - Va8yx) 8(€ — ef) around the 
k 


Fermi surface. With C3, symmetry (left panel), the integral goes to zero. After turning on the coupling 


between MnBizTe, and BP (right panel), C3, is broken, 
To more clearly see how the integra 


dl dl 9 


of the polar angle 0, Dutetric = J (VS — Vay) <= ae 
FS 


making the integral around the Fermi contour nonzero. 


changes to nonzero when C3, is broken, we rewrite Diyetric aS an integral 


(dl is an infinitesimal length along the Fermi surface). 


The inset presents the change of the above kernel [see section III.16 of (19) for details]. (E) Band structure of 


BP/6SL MnBizTe,/BP heterostructure as a function o 


f ky for ky = 0. Color represents the quantum metric 


of the bands. (F) Measured microwave rectification based on the intrinsic NHE. Inset: DC signal as a function 


of microwave frequency. (G) Schematic illustration of 


microwave rectification. (H) Schematic illustration of 


quantum metric-induced nonlinear responses. The horizontal axes are k, and ky. The two black arrows 


represent the Bloch wave functions at two nearby k point: 


s. The two arrows point to different directions, illustrating 


a finite distance between two wave functions (i.e., a finite quantum metric). This quantum metric leads to a NHE, 
which can turn an external AC electric field (e.g., the microwave in the figure) into a DC signal. 


bands with the same momentum hybridize. 
The coupling amplitude depends only on 
the characteristic decay length of the atomic 
orbitals as any discrete lattice structure is 
averaged out (47). The Hamiltonian reads 


hver Ur Us 
a at a a 
R{ke,ky)=| U,  hpp.t O- |. Amer and 
at rc 
U, O App. 


and top (bottom) BP. The spin-orbit coupling 
(SOC) in MnBi,Te, is crucial for a nonzero NHE 
because it allows the low-energy orbitals to 
feel the symmetry breaking by the AFM order 
[section III.15 of (79)]. In particular, the SOC 
was included in the model following the orig- 
inal work by (50). U, and U, denote the band 
hybridization caused by nearest-neighbor cou- 
pling between MnBi,Te, and BP. The bare 


hop. 1b) are Hamiltonians for 6SL MnBi,Te, 
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MnBi,Te, and BP parameters were obtained 


by fitting the first-principle band structures. 
The MnBi,Te, and BP coupling parameters 
were partly constrained by considering the 
independent data of Rex /Ry, so that an agree- 
ment in the overall magnitude was achieved 
independently. We adjusted the remaining free 
coupling model parameters to match detailed 
features in the charge density dependence [sec- 
tion IV.9 of (19)]. 

We first turn off the coupling between the 
MnBi,Te, and BP. The Fermi surface shown in 
Fig. 4C (-50 meV) is C3,, symmetric, and there is 
already a large quantum metric (227 and 2yz) 
around it. According to (22), the Detic Tespon- 
sible for the nonlinear Hall is given by Dyetie = 


I (epg — UrSy )5(€ — Eg) (v is the Fermi ve- 


locity). We plot the integral kernel in color in 
Fig. 4D. Positive and negative contributions 
around the contour exactly cancel each other 
out because of C3, symmetry. So, the integral | 
goes to zero (Fig. 4D, left panel). We then turn on 
the MnBi,Te,-BP couplings, which breaks C3... 
For the C3,-breaking contour, we observe un- 
equal contributions from the two colors, lead- 
ing to a nonzero Dyetric [Fig. 4D, right panel; 
see details in the caption and in section III.16 
of (79)]. Figure 4E shows the band structure 
of the BP/6SL MnBi,Te,/BP heterostructure, 
from which we can compute the intrinsic non- 
linear Hall conductivity core as a function of 
chemical potential. In particular, near the 
charge neutrality gap, we found that on: in- 
deed mainly comes from the quantum me- 
tric of the Dirac surface states, consistent 
with the intuitive picture above. The sign in- 
version in the electron-doped regime mainly 
comes from the quantum metric of the avoided 
crossing inside conduction bands according 
to our calculation [section IV.9 of (19)]. Ow- 
ing to the multiband nature of our model, the 
07°, was calculated by the general expression 


EmA=En 
oF 
Cran =a 28> Rel (4 
22 (el ry tin) (tal hate) 5( 2, — ep) (22), This gen- 
eral expression can be decomposed into the 
quantum metric dipole contribution plus 
additional interband contributions (AIC) 


(Un |TOky Um) (Um |tOky Un) 


En—Em 


On8y 
er d(en — ex) + AIC 
— €n 


ot, = 20 9 ete 
(1) 


where the first term is the quantum metric 
dipole contribution, and the second term is 


EmEn &5 


AIC= —2¢ 5 Re| (% 


n,m 


Un |Eey Un) (Um Fhe Un) 


En—Em 


OF (Un ey Um) (Um |tOky Un) 


a a= 8(En — Ep) Mis the 
band whose energy is closest to n). In our BP/ 
6SL MnBi,Te,/BP system, we found that the 
quantum metric dipole contribution strongly 


dominates, whereas the AIC is small [details 
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in section IV.3 of (19)]. Therefore, our non- 
linear Hall measurement is a powerful, energy- 
resolved probe of the quantum metric. 


Possible AFM spin-based wireless rectification 


The second-order nonlinear effect enables not 
only frequency doubling but also rectification. 
We use the intrinsic AFM nonlinear Hall effect 
to demonstrate wireless rectification with zero 
external bias (battery-free) and without mag- 
netic field. We inject microwave radiation and 
measure the DC signal. We observe clear rec- 
tification DC voltage in response to the micro- 
wave radiation (Fig. 4F), which shows a broad 
band response, including the Wi-Fi frequencies 
(2.4 and 5 GHz) and even higher frequencies 
[see fig. S35 and section V.2 of (9)]. In section 
III.12 of (19), we show that the rectification 
signal flips sign as we reverse the AFM state, 
which suggests the intrinsic quantum metric 
dipole origin. Apart from the intrinsic quan- 
tum metric dipole, extrinsic sources such as the 
Schottky diodes at the metal-MnBi,Te,, junction, 
unintentional diodes inside the MnBi,Te,, and 
the gap between the two gates can also lead to 
microwave rectification. To unambiguously rule 
out these extrinsic sources, future systematic 
experiments will be needed [discussion in 
section V.1 of (19)]. 


Discussion and outlook 


The intrinsic second-order Hall effect observed 
here realizes an electrical nonlinearity in- 
duced by the AFM spins and provides a rare 
example of a quantum metric response. As 
highlighted by recent theoretical studies, the 
influence of the quantum metric is expected 
to span many different areas, ranging from 
nonlinear responses in PT-symmetric AFMs 
to flat-band conductivity, superconductivity 
and charge orders in moiré systems, the frac- 
tional Chern insulator, and k-space dual of 
gravity (20, 22-34). Another interesting fu- 
ture direction is to explore the nonlinear re- 
sponses in canted AFM materials [section 
V.3 of (19)], where nonzero Berry curvature 
of higher order in magnetization has recent- 
ly been observed (38, 46, 51). In terms of ma- 
terials, we show that, beyond “band structure 
engineering,” the van der Waals interfaces 
can engineer the properties of the wave func- 
tion ie., “quantum geometry engineering” (47). 
Our observations may enable the use of AFM 
spins to harvest electromagnetic energy and 
to realize self-powered AFM spintronic devices. 
An exciting future breakthrough would be to 
demonstrate room-temperature wireless rectifi- 
cation based on the quantum metric NHE in a 
PT-symmetric AFM material. 
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CHATGPT complete two occupation-specific, ee Se 


Experimental evidence on the productivity effects 
of generative artificial intelligence 


Shakked Noy* and Whitney Zhang 


We examined the productivity effects of a generative artificial intelligence (Al) technology, the 
assistive chatbot ChatGPT, in the context of midlevel professional writing tasks. In a 

preregistered online experiment, we assigned occupation-specific, incentivized writing tasks to 

453 college-educated professionals and randomly exposed half of them to ChatGPT. Our results 
show that ChatGPT substantially raised productivity: The average time taken decreased by 40% and 
output quality rose by 18%. Inequality between workers decreased, and concern and excitement 
about Al temporarily rose. Workers exposed to ChatGPT during the experiment were 2 times as 
likely to report using it in their real job 2 weeks after the experiment and 1.6 times as likely 


2 months after the experiment. 


ecent advances in generative artificial 

intelligence (AI) may have widespread 

implications for production and labor 

markets. Generative AI systems such 

as ChatGPT or DALL-E, which can be 
prompted to create new text or visual outputs 
from large amounts of training data, are qual- 
itatively unlike most historical examples of 
automation technologies. Previous waves of 
automation predominantly affected “routine” 
tasks consisting of explicit sequences of steps 
that could be easily codified and programmed 
into a machine or computer, such as assembly- 
line manufacturing tasks or bookkeeping tasks 
(/, 2). By contrast, creative, difficult-to-codify 
tasks such as writing and image generation 
avoided automation, a pattern scholars have 
noted might change with the advent of deep 
learning, which now underpins generative 
AI systems. 

The emergence of powerful generative AI 
technologies reintroduces a host of classic 
questions in a new context (3-5). Automation 
technologies by definition perform specific 
tasks in place of humans. But more broadly, 
these technologies may either displace hu- 
mans completely from certain occupations or 
augment existing human workers by increas- 
ing their productivity (6-9). If automation 
technologies such as industrial robots mostly 
displace human workers, they can increase 
unemployment. Moreover, their impacts on 
aggregate productivity may be small or non- 
existent to the degree that they mainly serve to 
redistribute income previously earned by dis- 
placed workers to the capital owners supplying 
their robot replacements (0). If automation 
technologies such as computers augment ex- 
isting workers, they can simultaneously bene- 
fit workers, capital owners, and consumers by 
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raising wages, boosting productivity, and low- 
ering prices (J1-13). 

A potent generative writing tool such as 
ChatGPT could conceivably either displace or 
augment human labor. ChatGPT could en- 
tirely replace certain kinds of writers, such 
as grant writers or marketers, by letting com- 
panies directly automate the creation of grant 
applications and press releases with minimal 
human oversight. Alternatively, instead of dis- 
placing workers, ChatGPT could substantially 
raise the productivity of grant writers and mar- 
keters, for example, by automating relatively 
routine, time-consuming subcomponents of 
their writing tasks, such as translating ideas 
into a rough draft. In this case, these services 
would become cheaper and demand could ex- 
pand, resulting in higher employment and 
greater productivity for companies, cheaper 
products for consumers, and potentially higher 
wages for workers (/4). Furthermore, inequal- 
ities between workers could either decrease if 
lower-ability workers are supported more by 
ChatGPT or increase if higher-ability workers 
have the skills necessary to take advantage of 
the new technology. 

Which of these eventualities will generative 
AI systems bring about? The answer depends 
on a host of research questions (RQs). RQ]: How 
does access to generative AI systems affect work- 
ers’ productivity in existing tasks? Do workers 
choose to use these systems? Conditional on 
using these systems, how do workers interact 
with them and how do they affect productivity 
(15-18)? RQ2: Do these systems differentially 
affect low- and high-ability workers? RQ3: How 
do workers subjectively react to these technol- 
ogies (19)? 


Methods 


This paper took the first step toward answer- 
ing these questions (20). In a preregistered 
online experiment, we recruited 453 experi- 
enced, college-educated professionals on the 
survey platform Prolific and assigned each to 


ized writing tasks (27). The experiment 1. 
place from 27 January to 21 February 2023 
and involved GPT-3.5. The occupations that 
we drew from were marketers, grant writers, 
consultants, data analysts, human resource 
professionals, and managers. The tasks, which 
included writing press releases, short reports, 
analysis plans, and delicate emails, comprised 
20- to 30-min assignments designed to resem- 
ble real tasks performed in these occupations. 
Indeed, most of our participants reported 
completing similar tasks before and rated the 
assigned tasks as realistic representations of 
their everyday work (see the supplementary 
materials). 

Participants faced high-powered incentives in 
the form of large bonus payments to produce 
high-quality work. They received a base pay- 
ment of $10 plus up to $14 in bonus payments 
for output quality, with the average overall rate _ 
of $17/hour substantially exceeding the Prolific 
standard of $12/hour. We cross-randomized 
the structure of bonus payments faced by par- 
ticipants to show the robustness of our results 
to different incentive schemes (see below for 
more details). Output quality was assessed by 
blinded experienced professionals working in 
the same occupations. Evaluators were asked 
to treat the output as if encountered in a work 
setting and were incentivized to grade outputs 
carefully on a scale of 1 to 7 (22). Each piece of 
output was seen by three evaluators, with an 
average within-essay cross-evaluator correla- 
tion of 0.44 (23). 

We randomly assigned 50% of participants 
to the treatment group and 50% to the control 
group. The treatment group was instructed to 
register for ChatGPT between the first and 
second task, received guidance on using it, and 
were told they were permitted to use it on the 
second task if they found it useful. The control 
group was instead instructed to register for 
the LaTeX editor Overleafin an attempt to hold 
the time and hassle costs of signup constant 
between the two groups. The control group 
was not told they could use Overleaf on the 
second task and <5% of participants subse- 
quently reported using it. 

In addition to output quality evaluations, 
we collected self-reported and objective mea- 
sures of participants’ time spent on the tasks 
and took snapshots of participants’ outputs 
each minute while they performed the task to 
construct objective measures of activity and to 
detect ChatGPT usage (see the supplementary 
materials). 

A complete description of our experimental 
design, copies of relevant survey questionnaires, 
and additional figures validating our central 
measures and extending our main results are 
included in the supplementary materials. De- 
scriptive statistics about the sample, as well 
as balance and selective attrition tests, are 
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available in Table 1. The attrition rate was 
6% in the control group and 11% in the treat- 
ment group. Balance tests indicate that across 
13 pretreatment characteristics, the treatment 
and control groups exhibited a small but sig- 
nificant difference only for only two character- 
istics: employment status and being a human 
resources professional. Our partly within- 
person design, which controls for performance 
on the pretreatment task, should eliminate any 
influence of selective attrition on our results. 
In the supplementary materials, we also re- 
port Lee bounds (24) on our main results and 
versions of our results controlling for employ- 
ment status and occupation, which confirmed 
that our results are highly robust to selective 
attrition. 


Results 
Take-up of ChatGPT 


In the treatment group, 92% of treated par- 
ticipants successfully registered for ChatGPT, 
and 80% chose to use it on the second task (25). 
Users gave it an average self-assessed useful- 
ness score of 4.4 out of 5. 

Before treatment, 70% of our participants 
had heard of ChatGPT and 32% had used it 
before. Self-reported and objective measures 
indicated that ~10 to 20% of the control group 
used ChatGPT on the tasks (see the supple- 
mentary materials), meaning there was at 
least a 60-percentage point experimentally 
induced gap in usage between our treatment 
and control groups on the second task. Our 
estimates reflect the effects of ChatGPT on 
the average productivity of the 60 to 70% of 
participants whose usage was determined by 
their treatment assignment, and constitute 
lower bounds on the effects of ChatGPT usage 
on productivity. In the supplementary mate- 
rials, we report two-stage least-squares results 


adjusting our estimates upward for imperfect 
compliance. 


Productivity 


We first show results for our two productivity 
measures: time taken and evaluator grades 
(Fig. 1). The experimental intervention shifted 
both outcomes substantially. In the treatment 
group, time taken on the posttreatment task 
dropped by 11 min (0.75 SDs) relative to the 
control group, who took an average of 27 min 
(P < 0.001). Average evaluator grades in the 
treatment group increased by 0.45 SDs (P < 
0.001), with similar increases for overall grades 
and specific grades for writing quality, content 
quality, and originality. 

These effects are not limited to specific 
pockets of the time or grade distributions. As 
shown in Fig. 1, C and D, the entire time dis- 
tribution shifted to the left (faster work) and 
the entire grade distribution shifted to the 
right (higher quality). At the individual worker 
level, as shown in Fig. 2, treated workers who 
received a low grade on the first task experi- 
enced both 1- to 2-point increases in grades 
and 10-min decreases in time spent, whereas 
workers who received a high grade maintained 
their grade level while also reducing their time 
spent by ~10 min. 

These results are virtually identical across 
our two main incentive schemes, which cov- 
ered 80% of respondents: a “linear” scheme 
in which respondents were paid $1 for each 
point they received on each submission (each 
of which was graded on a 1- to 7-point scale), 
and a “convex” scheme in which respondents 
were additionally paid $3 for earning a grade 
of 6 or 7. The results shown in Fig. 1 are based 
on these two incentive schemes. The fact 
that treated participants reduced their time 
spent by a similar amount even when faced 


with strong incentives to produce high-quality 
output (under the convex scheme) demon- 
strates that the time-saving effects of ChatGPT 
are not specific to linear payment regimes and 
apply robustly across incentive structures. 

In our third incentive arm involving 20% 
of participants, we required participants to 
spend exactly 15 min on each task, thereby 
holding effort fixed across the treatment and 
control groups and allowing us to interpret 
any difference in grades as a pure effect of 
ChatGPT access on productive capacity. In 
this arm, the treatment increased grades by 
a similar albeit not statistically significant 
0.33 SDs (26). 

In an additional intervention, after com- 
pleting the second task, 30% of the treatment 
group were shown their first-task human- 
created output and given the opportunity to 
edit or replace it using ChatGPT. Of these 
participants, 19% chose to replace their re- | 
sponse with ChatGPT’s output and another 
17% used ChatGPT to edit their original re- 
sponse, suggesting that participants viewed 
ChatGPT as a means to improve output qual- 
ity as well as save time. 


Productivity inequality 


The control group exhibited persistent pro- 
ductivity inequality: Participants who scored 
well on the first task also tended to score well 
on the second task. As Fig. 2A shows, there 
was a correlation of 0.41 (P < 0.001) between 
a control participant’s grade on the first task 
and their grade on the second task, holding 
the evaluator constant. 

In the treatment group, initial inequalities 
were more than half-erased by the treat- 
ment: The correlation between first-task and 
second-task grades was only 0.14 (P < 0.001 
for the difference in slopes). This reduction 


Table 1. Descriptive statistics. 


Variable 


Annual salary in main job ($) 


n: data analyst 


n: manager 


t (task 1, min) 


Self-efficacy (task 1, scale of 1 to 10) 


N (control) 


Mean (control) 


234 67,764 2 


N (treatment) 


Difference 
AV73 


Mean (treatment) 
71,938 


This table presents descriptive statistics for our sample. We recode salary reports of >$500,000 to missing (affects two observations). “Employed” includes full-time and part-time employment. 


SP Ut P= 05: 
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A Time Taken Decreases 


B Average Grades Increase 
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Fig. 1. Treatment effects on productivity. Effects shown are restricted to 
the linear and convex incentive groups. (A) and (B) Means (and 95% 
confidence intervals for those means) of self-reported time taken and average 
grades in the first and second task separately in the treatment and control 
groups. The results look very similar for the objective measure of time 

active (see the supplementary materials). Also shown are treatment effect 
coefficients and 95% confidence intervals, rescaled to be in terms of pretreatment 
SDs of the outcome variable. The coefficients are estimated from regressions 


in inequality was driven by the fact that par- 
ticipants who scored lower on the first task 
benefited more from ChatGPT access. As Fig. 
2A shows, the gap between treatment and 
control is much larger at the left end of the 
& axis. 


Human-machine interactions 


What kinds of human-machine interactions 
underlie the productivity results documented 
above? Did workers paste the task prompt into 
ChatGPT and immediately submit its output, 
minimizing their time spent and increasing 
their grades because ChatGPT’s writing abilities 
exceeded theirs? Or did they treat ChatGPT as 
a helpful but imperfect tool, for example, using 
it to create a rough draft and then spending 
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time editing and improving the draft or using 
it to brainstorm or edit? 

Our evidence supports the first possibil- 
ity. Almost everyone submitted lightly ed- 
ited or unedited ChatGPT output, and we 
observed small time expenditures on edit- 
ing and no resulting improvement in re- 
spondents’ grades. In the treatment group, 
33% of participants reported submitting 
ChatGPT’s initial output without editing it, 
and 53% reported editing before submit- 
ting. However, those who reported editing 
were active on the task for only 3.3 min on 
average after we first observed them pasting 
in a large quantity of text (presumably from 
ChatGPT), with most active for 0 to 2 min 
(27). Qualitative examination suggests that 


of the within-participant change 
on a treatment dummy, occupation*task order fixed effects, and incentive 

arm fixed effects. In (A), this is at the participant level and SEs are hetero- 
skedasticity robust. In (B), this is at the participant-evaluator level, the 
regression also includes grader fixed effects, and SEs are clustered at the 
participant level. (© and D) Raw graphs of the outcome distribution in the 
treatment versus control group on the second task; (C) is at the participant level 
and (D) is at the participant-evaluator level. 


3 4 5 
Second-Task Grade 


in outcome from before to after treatment 


most of this editing was superficial, such as 
changing a placeholder or rearranging a sen- 
tence. Evaluator grades also suggest that 
this editing was ineffectual. There was no 
correlation between how long a participant 
was active after pasting in the ChatGPT text 
and the grade they ultimately received, and 
treated respondents who used ChatGPT did not 
receive higher average grades than the raw 
ChatGPT output that we gave to evaluators to 
grade (see the supplementary materials). 

It is not obvious whether these dynamics 
should be interpreted as evidence that ChatGPT 
will displace human workers or evidence that 
it will augment them. Although ChatGPT di- 
rectly substituted for participants’ effort with 
little need for human input, it also enabled 
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Fig. 2. Effects on grades 
and time across the 


A Grade Inequality Decreases 


initial grade distribution. 64 
Participant-evaluator 
observations are binned 

together according to 54 
the task 1 grade given to 
this participant by this 
evaluator. (A and B) Within 
each bin, the panels plot 
the average task 2 grade 
(A) or task 2 time taken oc 
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latter results were cal- 
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culated from a participant- 
evaluator level regression 
of the outcome variable on 
the task 1 grade, treatment 
status, treatment*task 

1 grade, and grader 

fixed effects, clustering 
SEs at the participant 
level. The control group 
slope is the coefficient 
on the task 1 grade, 

and the difference in 
slopes is the coefficient 
on the treatment*task 

1 grade interaction. 

Note that this difference 
in slopes will not match 


Time Taken: Task 2 


--8-- Control 


4 
Grade: Task | 


—+— Treatment 


up exactly with the difference between the raw slopes plotted in the graph because these raw slopes do not 


use grader fixed effects. 


participants to complete tasks much faster. 
We reflect on this further in the Discussion. 


Subjective outcomes: Job satisfaction, 
self-efficacy, and beliefs about automation 


Many of our treated participants had never 
heard of (30%) or never used (68%) ChatGPT 
before participating in the experiment. We 
used a battery of questions to assess their sub- 
jective reactions to encountering the technol- 
ogy. As depicted in Fig. 3, participants enjoyed 
the tasks more by 0.47 SDs when given access 
to ChatGPT (P < 0.001). Treated participants’ 
concern for (P < 0.01) and excitement about 
(P < 0.001) future effects of AI on their oc- 
cupations rose, and their overall optimism 
increased by 0.2 SDs (P < 0.05). These effects 
disappeared in the 2-week and 2-month follow- 
up surveys, indicating that they are best in- 
terpreted as short-run phenomena reflecting 
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respondents’ first experiences with the tech- 
nology (28). 


Two-week and 2-month follow-up surveys 


One powerful indication of the value of ChatGPT 
to participants is whether they continued to 
use it after the experiment, in their actual 
jobs. To track this, we resurveyed participants 
2 weeks and 2 months after their completion 
of the initial survey, with response rates of 
92% and 83%, respectively, and no treatment- 
control imbalance in response rates. 

In the 2-week follow-up, 34% of former treat- 
ment group participants reported using ChatGPT 
in their job in the past week, compared with 
18% of control group participants (P < 0.001). 
This large gap in usage fully persisted into the 
2-month follow-up, when 42% of the treat- 
ment group and 27% of control respondents 
reported using ChatGPT in their jobs in the 


past week (P < 0.01). The persistence of this gap 
suggests that the dissemination of ChatGPT 
into real professional activity is still in very 
early stages, with usage held back by a lack 
of knowledge about or experience with the 
technology. 

In the 2-week follow-up, ChatGPT users 
gave the technology an average usefulness score 
of 3.66 out of 5.00, somewhat lower than in our 
main experiment, likely because of the greater 
length and complexity of real-world tasks. The 
participants reported using it for a broad range 
of tasks such as generating recommendation 
letters for employees, responding to customer 
service requests, brainstorming, rough-drafting 
emails, and editing. 

Nonusers were divided into three roughly 
equal-sized groups, reporting that: (i) ChatGPT 
was not useful in their job, (ii) they did not 
know about it or did not have an account, or 


(iii) it was not allowed in their workplace or | 


was usually unavailable during the day. The 
one-third of nonusers who claimed that it was 
not useful in their job mostly said that this 
was because the chatbot lacks context-specific 
knowledge that forms an important part of 
their writing. For example, they reported that 
their writing was “very specifically tailored to 
[their] customers and involves real time in- 
formation” or “unique [and] specific to [their] 
company products.” 


Discussion 


College-educated professionals performing 
midlevel professional writing tasks substan- 
tially increased their productivity when given 
access to ChatGPT. The generative writing 
tool increased the output quality of low- 
ability workers and reduced time spent on 
tasks for workers of all ability levels. At the 
aggregate level, ChatGPT reduced inequal- 
ity. It is already being used by many workers 
in their real jobs. 

These results are consistent with other 
studies showing productivity-enhancing and 
equalizing effects of recent AI technologies 
(8, 15, 16, 18). Relative to these studies, we 
analyzed productivity effects across several 
occupations and tasks, examined how workers 
use ChatGPT, measured subjective reactions 
to the technology, and documented persistent 
effects of our treatment on ChatGPT usage in 
real jobs. 


Limitations 


The experiment had several important limita- 
tions. We examined a limited range of occupa- 
tions and tasks in which ChatGPT may be 
unusually useful. The tasks demanded clear, 
persuasive, relatively generic writing, which 
are arguably ChatGPT’s central strengths. They 
did not require context-specific knowledge 
or precise factual accuracy. The version of 
ChatGPT used in this experiment cannot, by its 
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Fig. 3. Job satisfaction, self- 
efficacy, and beliefs about auto- 


A Job Satisfaction Increases B 


Self-Efficacy Marginally Increases 


mation. (A) and (B) Job satisfaction 44 
and self-efficacy (originally elicited 
on scales of 1 to 10, normalized to 
have mean = 0 and SD = 1) before 
and after treatment in the treat- 
ment and control groups. Dots are 
means and error bars are 95% 
confidence intervals for means. Also 
shown are the coefficients on the 
treatment effect of a regression 
specified as in Fig. 1A. (€) Cross- 
sectional comparison of beliefs 44 


Job Satisfaction (SDs) 
o 


Treatment Effect: 


—4— Treated 
--G-- Control -.27 


0.47 SDs 
95% CI: [0.29, 0.65] 


Self-Efficacy (SDs) 
Oo 


Treatment Effect: 0.19 SDs 
95% CI: [-0.01, 0.39] 


—4— Treated 
--G-- Control 


about automation in the treatment 
and control group, all on a scale 
of 1 to 10. The first question was 
“How worried are you about 
workers in your occupation being 
eplaced by Al?” The second was 
“How optimistic are you that Al 
may make workers in your occupa- 
tion more productive?” The third 
was “How do you feel about the 
impacts of future advances in 

Al (where 1 = very pessimistic and 
0 = very optimistic).” 


Average Response (1-10 Scale) 


nature, access or supply context-specific knowl- 
edge and is not a reliable source of precise fac- 
tual information. 

The tasks could also be described through 
short, self-contained prompts, making use of 
ChatGPT easy, whereas many real-world tasks 
involve vaguer objectives and instructions, re- 
quiring workers to exercise initiative in deter- 
mining what to do. Finally, participants in 
our tasks faced direct incentives in the form of 
bonus payments scaling with output quality, 
which encouraged them to maximize gener- 
ic output quality and minimize time spent. 
White-collar workers are typically incentivized 
through longer-run promotion and firing incen- 
tives, which might instead encourage conspicu- 
ous exertion of effort or the development of a 
consistent personal style, both of which make 
ChatGPT less useful. 

The tasks and incentive schemes were cho- 
sen to meet the constraints of the experimen- 
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Pre-Treatment 


; 
Post-Treatment 


Pre-Treatment Post-Treatment 


C Effects on Three Beliefs About Automation 


diff: 0.266 SDs | 
p-value: 0.005 


diff: 0.379 SDs 
p-value: 0.000 


diff: 0.206 SDs 
p-value: 0.029 


— 


Worried About Excited About Overall 
Replacement Enhancement Optimism 
Control [EEE Treatment 


tal design. We required short tasks that could 
be explicitly described for and performed by 
a range of anonymous workers online, and 
we needed to incentivize serious effort. Our 
judgment was that building factual accuracy 
requirements into the tasks would either re- 
sult in tasks that felt artificial and unnatural 
(e.g., requiring participants to Google and re- 
port one or two specific facts) or overwhelm 
our budget for evaluators (e.g., giving par- 
ticipants an open-ended research task and 
exhaustively fact checking their assertions). 

The aforementioned factors limit but do not 
eliminate the generalizability of our results. In 
real-world tasks, the need to fact-check ChatGPT’s 
output will reduce its time-saving benefits, 
but the speed and writing quality increases 
observed in our experiment are sufficiently 
large that we suspect that ChatGPT will still 
often be useful. Moreover, newer versions of 
ChatGPT are more consistently factually ac- 


curate, and some versions can access the in- 
ternet to fact check themselves. We speculate 


workers may find iterative rounds of prompt- 
ing and discussion with ChatGPT useful even 
if they cannot immediately prompt out a final 
product. In these contexts, ChatGPT and human 
workers may be more strongly complemen- 
tary than in our experiment. The importance 
of context-specific knowledge will also limit 
ChatGPT’s utility, but there are plausible 
work-arounds. ChatGPT can be instructed to 
incorporate lists of context-specific factors, 
and organizations may be able to build cus- 
tomized ChatGPT-like models. Our follow-up 
surveys show that many workers do find 
ChatGPT useful in their real jobs. 

Overall, we speculate that, relative to our 
experimental findings, the direct productivity 
effects of ChatGPT in the real economy will be 
somewhat lower and the technology will be 
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more strongly complementary to human work- 
ers. To what extent either of these is true re- 
mains an open question. 


Implications 


Our experiment captured only direct, immediate 
effects of ChatGPT on worker productivity. We 
could not examine the complex labor market 
dynamics that will arise as firms and workers 
adapt to ChatGPT. Several factors will mediate 
how the direct productivity impacts of ChatGPT 
affect wages and employment in exposed oc- 
cupations. The first is the degree to which 
demand for goods produced by ChatGPT could 
expand as ChatGPT-fueled productivity in- 
creases make those goods much cheaper. For 
example, demand for programming services 
could plausibly expand massively if the price 
of those services fell. Aggregate programming 
employment might consequently increase. It 
is less clear whether demand for advertising or 
communication could expand as much, poten- 
tially entailing a reduction of employment in 
those sectors as fewer workers are needed to 
meet the same static demand. As an additional 
complication, ChatGPT might directly affect 
the composition of demand. For example, be- 
fore ChatGPT, a piece of writing signaled that 
a company had invested at least some human 
labor, thought, and judgment into a message, 
which consumers might have appreciated; 
with this no longer being the case, demand 
for these messages could decrease (29). 

The second factor is the nature and scarcity 
of the human skills best complemented by 
ChatGPT. Consider, for example, the use of 
ChatGPT to produce advertising content. Is 
this best accomplished by one senior adver- 
tising manager directly providing high-level 
guidance to ChatGPT or by 10 junior adver- 
tisers carefully designing prompts and editing 
ChatGPT’s output? The answer will determine 
the structure of employment in the advertising 
sector. Similarly, suppose ChatGPT is highly 
complementary to human labor in program- 
ming tasks. If ChatGPT’s human copilot needs 
to be an expert programmer capable of di- 
rectly proofreading its output, then this could 
raise programmers’ wages by boosting their 
productivity while their expertise remains scarce. 
If, by contrast, the complementary human role 
requires only basic programming knowledge 
and mainly involves checking output and re- 
fining natural-language prompts, then the pool 
of potential programmers would vastly in- 
crease and wages could fall even as produc- 
tivity rises. More generally, tools such as ChatGPT 
could make expertise more accessible by facil- 
itating learning (30). 

Finally, the diffusion and effects of ChatGPT 
will also depend on organizational consider- 
ations that our experiment treating isolated 
individual workers does not address. ChatGPT 
might interact with traditional promotion and 
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hiring systems based partly on conspicuous 
exertion of effort. Large language models may 
be used to monitor or evaluate workers and 
avoid paying higher wages (37). Organiza- 
tional and societal norms around the accept- 
ability of using tools such as ChatGPT may 
take time to cohere and may significantly 
affect adoption of the technology (32-35). 

Overall, the arrival of ChatGPT ushers in an 
era of vast uncertainty about the economic 
and labor market effects of AI technologies 
(36-38). Our experiment takes the first step 
toward answering the many questions that 
have arisen. 
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QUANTUM OPTICS 


Quantum vortices of strongly interacting photons 


Lee Drori}, Bankim Chandra Das{, Tomer Danino Zohar, Gal Winer, Eilon Poem, 


Alexander Poddubny, Ofer Firstenberg* 


Vortices are topologically nontrivial defects that generally originate from nonlinear field dynamics. 
All-optical generation of photonic vortices—phase singularities of the electromagnetic field—requires 
sufficiently strong nonlinearity that is typically achieved in the classical optics regime. We report 

on the realization of quantum vortices of photons that result from a strong photon-photon interaction 
in a quantum nonlinear optical medium. The interaction causes faster phase accumulation for 
copropagating photons, producing a quantum vortex-antivortex pair within the two-photon wave 
function. For three photons, the formation of vortex lines and a central vortex ring confirms the 
existence of a genuine three-photon interaction. The wave function topology, governed by two- and 
three-photon bound states, imposes a conditional phase shift of x per photon, a potential resource 


for deterministic quantum logic operations. 


hotons do not interact with one another 

in the optical regime. An effective inter- 

action between photons can be medi- 

ated by matter, but, in conventional 

optical nonlinear media, this interac- 
tion is negligible on the level of individual 
photons. It is only at the ultimate limit of quan- 
tum nonlinear optics, realized in specially en- 
gineered systems, that a single photon can 
alter the optical response of the system, ren- 
dering an appreciable photon-photon inter- 
action (J-4). 

The realization of strong photon-photon in- 
teractions in atomic ensembles can enable de- 
terministic quantum-information processing 
for optical qubits, such as single-photon tran- 
sistors and phase gates (5-7). It can be used 
to generate nonclassical states of light, such 
as squeezed, cluster, and repeater states, as a 
resource for quantum communication, com- 
putation, and sensing (8, 9). It can also mani- 
fest in interacting quantum gases and fluids 
of photons, allowing the exploration of exotic 
many-body physics for light (10-13). 

We realize an extreme regime of quantum 
nonlinear optics and observe optical quantum 
vortices generated by the interaction between 
only two or three photons. Quantum vortices— 
phase singularities of the wave function—are 
typically expected in strongly interacting sys- 
tems of many particles, such as nonlinear op- 
tical systems (J4, 15), superfluids (16-19), and 
hybrid light-matter systems (20-24). In this 
work, we induce a strong, effective interaction 
at the few-photon level by coupling photons to 
Rydberg atoms in an ultracold rubidium gas 
(Fig. 1, A and B). 

The photons propagate through the me- 
dium as light-matter polaritons, at 10~° times the 
speed of light. Because of the strong van der Waals 
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coupling between the Rydberg atoms, photons 
that propagate close to each other—at a dis- 
tance smaller than the so-called blockade radius 
Tp—experience a local change in the refractive 
index. For two photons, the optical field can be 
described by a wave function w(@, 2), which 
is the probability amplitude of having photons 
at coordinates 2, and 2, along the medium (25). 
The change in refractive index at |v — #| < Tp 
causes the accumulation of excess local phase 
in y(@, 2X), which, under the correct conditions, 
produces vortices in w(a%, %2). 

To understand the vortex formation, we con- 
sider a simplified analytical model describing 
the evolution of w(%, 2) in a medium of uni- 
form density and length L using the two-photon 
Schrédinger equation (25, 26): 


: 1 
tay =-5—Aw+V (ry (1) 


Here, 7 = 2 — 22 is the relative coordinate of the 
two photons, and the center-of-mass coordi- 
nate along the medium R = (a + @ + L)/2 = 0... 
L plays the role of time. The incoming, co- 
herent probe field dictates homogeneous initial 
conditions w(7, R = 0) = 1. The photon-photon 
interaction is described by a nearly square 
potential V(r) = U/(1 + r°/7,%), where U con- 
stitutes the change in the refraction index due 
to the Rydberg blockade, and the (negative) 
effective mass m = —U/8 arises from single- 
photon dispersion. Because mU < 0, Eq. 1 de- 
scribes attraction between photons (25). 

A nontrivial prediction of Eq. 1 is the for- 
mation of vortex-antivortex pairs in the me- 
dium, symmetrically around the potential well 
(Fig. 1C). Equation 1 has localized and extended 
eigenstates, denoted as bound and scattering 
two-photon states (25). Considering a single 
bound state Wpouna(7) with energy E, which is 
the case in our experiment, the initial scatter- 
ing component is Wecat = 1 - Woouna(7). In the 
crudest approximation, the phase accumula- 
tion of the scattering component can be ne- 


q 


glected (27), and the solution to Eq. 1 reduc} ed 


w(7, RB) = ey uma(T) + Vecat (2) 
The phase vortices are formed around y = 0 
when the two terms—the bound and scatter- 
ing components—cancel each other. This re- 
sult is generic and occurs for a sufficiently long 
medium and for any number of bound states. 
Such an interference mechanism for vortex- 
antivortex pair formation is universal and 
known from acoustics (28) and atomic col- 
lisions (29) to linear (30, 37) and nonlinear (32) 
optics. However, such vortex-antivortex pair 
formation has not been observed before in 
quantum optics because the required strong 
and prolonged photon-photon interaction had 
been unattainable. 

We quantify the duration and strength of 
the interaction by the dimensionless parame- 
ters p = UL/2 and 4 = 2|Um|rzZ (83), respec- 
tively. For vortices to develop, ~A should be 
large. When the interaction is weak, that is, for 
X « 1, we find the condition (A > @o, with the 
threshold phase @o ~ 0.94. For moderate in- 
teractions i = 1, the condition is ~ > @o [see 
section $2.4 in (27)]. These conditions are 
illustrated by solid lines in the “phase diagram” ‘ 
in Fig. 1D, and they compare reasonably well 
with the domain of vortex formation calculated 
numerically for our experimental, nonuniform 
medium (green shading). ‘ 

In Rydberg polariton systems, the essen- 
tial experimental parameters are the total 
optical depth OD of the medium (e~°” being 
the on-resonance transmission) and the optical 
depth of the blockade range OD, = OD-7)/L. 
These parameters govern ~ ~ OD and A ~ 
OD,,”. In our experiment, we reach OD = 110 
and OD, ~ 22, setting @ ~ 2.7m and A = 3 
(blue diamond in Fig. 1D). The moderate inter- 
action obtained in previous experiments (~ = 
1.5n and i « 1), albeit being enough to ob- ‘ 
serve the signatures of photonic bound states 
(25, 34, 35), were well below the threshold 
for vortex formation. ‘ 


Observation of two-photon vortices 


Our experiment starts by compressing and 
trapping an ultracold rubidium cloud with a 
Gaussian density profile pye~”’/?” centered at 
x = 0 and effective length L = /2no0 ~ 75 um 
along the optical axis 2. Our maximal peak 
density po = 5 x 10” atoms/cm’ is a factor of 
three to six higher than that in previous ex- 
periments and is the primary source of our 
large i 9° Po We form Rydberg polaritons 
using counter-propagating probe and con- 
trol fields, which together resonantly couple 
the atomic 5S ground level to the 100S Rydberg 
level via the 5P level (Fig. 1B). We send, on av- 
erage, f ~ 0.25 probe photons per microsecond. 

The probe photons experience the three- 
level optical susceptibility, except within the 
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J 


control 


0 


probe 


Fig. 1. Setup and conditions for generating photon 


mediated quantum nonlinear optics. (A) Counter-propagating probe (red) and 
control (blue) fields are focused onto an elongated ultracold atom 
absorption image), held in a crossed optical trap (green). After traversing the 


medium, the probe light is split and measured by four 
to D3 and D,. Detectors D; to D3 provide the two-pho 
correlation functions g and g. Detector Dy, which 
beat between the probe and reference light (LO, local 
with detectors D, to D3 to extract the conditional pha 


fig. S2). (B) Atomic-level structure for generating polaritons compri 


Rydberg orbital. The van der Waals interaction (vdW) 


atoms disturbs the propagation of two close polaritons, leading to local 


blockade range 7, ~ 15.3 um, where they 
experience the two-level (5S-5P) susceptibility. 
Ideally, we want the difference U between 
these susceptibilities to be purely real, render- 
ing a conservative photon-photon interaction. 
To this end, we first detune the probe from the 
atomic transition by A = 9.41, where T is the 
half-width of the 5P level, and then adjust 
the control frequency such that the transmis- 
sion remains the same outside and inside of the 
blockade range (25). This adjustment elimi- 
nates the residual nonconserving (dissipative) 
part of the interaction and amplifies the con- 
serving part U = (OD/L)(qI/A) by the factor 
q = 14 (27). The atomic density decreases dur- 
ing the experimental cycle by a factor of 5.5, 
allowing us to study the full range 110 = OD = 
20 (2.7m = ¢~ = 0.5m) and, correspondingly, 22 = 
OD, = 4.1 (8 = A = 0.1) in each experiment 
(dotted line in Fig. 1D). 

The vortex-antivortex pair forms in the two- 
photon wave function w(7, R) of the probe 
inside the medium. Although we cannot di- 
rectly measure these vortices within the medium, 
we experimentally determine their presence and 
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vortices in Rydberg- 
two-photon wave functi 
ic cloud (left, 
pair of two interacting 
single-photon detectors D; — shaded area, calculated 
ton and three-photon 
measures the interference 
oscillator), is correlated 
ses 6 and 6° (see also 
sing the 100S 
between two Rydberg 


structure from the correlations of the outgoing 
photons, essentially measuring (7, R = L) at 
the medium’s boundary. We track the depen- 
dence of y(7, R = L) on the optical depth OD, 
and by varying OD, we effectively vary the 
interaction duration. The dependence on OD 
at the boundary becomes a proxy for the evo- 
lution in the bulk. Whereas OD corresponds 
to R, the temporal separation t between the 
outgoing photons coarsely corresponds to their 
spatial separation r in the medium (36). Ex- 
perimentally, we measure the two-photon cor- 
relation function g(a) and conditional phase 
P(t) of the outgoing probe field for varying 
OD and associate them, respectively, with the 
squared amplitude and phase of (7, R = L). 
In the experimental results (Fig. 2, A and B), 
we first observe the gradual bunching of pho- 
tons, g(0) > 1, as the OD increases, accom- 
panied by depletion of g@(z) at |t| = 0.25 us 
[see cross section (i) in Fig. 2A]. The bunching 
and depletion are due to the effective attraction 
between the photons, which is governed by a 
two-photon bound state and accompanied by 
accumulation of conditional phase (0) < 0 


interaction strength A 


1008/2 


control 


20 
accumulated phase @ 


accumulation of excess propagation phase. (C) Amplitude and phase of the 

on, calculated numerically in the Schrédinger approxi- 
mation (Eq. 1, with U = V12/rp), showing the formation of a vortex-antivortex 
photons. (D) Conditions for vortex generation (green 
numerically for a realistic Gaussian cloud) in terms of the 
photon-photon interaction strength 4 and the accumulated interaction phase in 
the finite medium q. The solid curves are minimal conditions calculated 
analytically for a uniform cloud under the approximation of Schrodinger evolution 
(27). The dotted blue line indicates the available ~ and A in our experimental 
setup, peaking at the blue diamond. Red circles indicate the conditions 
achieved in (25, 34, 35). The inset image depicts the phase structure of a 
single vortex-antivortex pair. 


[see (i) in Fig. 2B] (25). At OD = 78, 60) 
reaches -n. A x conditional phase allows for 
deterministic, maximally entangling operation 
on photonic qubits (7), and here it is obtained 
for copropagating photons. 

The vortex-antivortex pair is formed around 
OD = 80, as manifested by the clockwise and 
counterclockwise phase twists in Fig. 2B (or- 
ange arrows). These phase twists correspond 
to steep steps of o@(z) at |t| ~ 0.25 us, whose 
direction flips when the OD increases [com- 
pare (ii) with (iii) in Fig. 2B]. Furthermore, 
as expected, the photons are depleted from the 
vortex core, where g(r) = 0.24 (global minima 
in Fig. 2A). Following the vortex-antivortex for- 
mation, 60) decreases gradually from x to 
zero, and the bunching and depletion in g 
get smaller. The vortices effectively “unwind 
the tension” between the regions of fast and 
slow accumulation of phase. 

Our experimental results are also corrobo- 
rated by numerical simulations based on the 
model described in (25, 36). The simulations 
use the actual experimental parameters, in- 
cluding the Gaussian density profile of the 
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Fig. 2. Two-photon A 
vortices. (A and B) 

Color maps show the 
experimental (A) two- 


g (data) 


photon correlation —100(' & 
g® and (B) two-photon Q Lint 
phase $° as a function Ss 30 

of the time between = 

photons + and the optical o) 

depth of the atomic = a 
medium OD. Enlarged s 40 
views (i) to (iii) explicitly Q 

show the measured = 40 


OFPNORN OFRN 


curves g(t) and o°>() 
for OD = 49, 78, and 95. 
A vortex-antivortex pair 
[orange arrows in (B)] is 
formed around OD = 80 
on the edge of the medium 
and is thus captured by 
the detectors as a phase 
step around |t| = 0.25 us 
and a phase wrap of 
6(t = 0) from -n to +n. 
At the vortices’ core, 

g® approaches zero. 

(C) Numerically calculated 
6 for the experimental 
conditions. (D and E) Nu- 
merically calculated 
phase of the stationary, 
space-dependent two- 
photon wave function, 
showing the two-photon 
vortices forming (D) deep Cc 
inside the medium and (E) 

on the edge. Dashed circles 
represent the edge (20) of 

the Gaussian cloud. 


—n —1/2 


100 


optical depth (OD) 


—n —1/2 


optical depth (OD) 


T (us) 


medium and the propagation of the second 
photon after the detection of the first photon. 
They are presented in Fig. 2C and are in excellent 
agreement with the measured data, reproduc- 
ing the vortex-antivortex pair and the surround- 
ing phase structure in full detail. The 10% 
discrepancy in the OD at which the vortices 
are observed is due to a slight saturation of our 
phase detection scheme and is completely 
resolved for a weaker incoming probe (see fig. 
S5). We use these simulations to calculate the 
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conditions for vortex formation in Fig. 1D (green 
shaded area). 

Moreover, the simulations are able not only 
to reproduce the experimentally accessible data 
but also to establish the dynamics of the two- 
photon wave function (2, #2) inside the me- 
dium. Figure 2, E and D, presents the phase of 
wW(2, &) for two values of OD. These simu- 
lations exhibit the entrance of phase singulari- 
ties and eventually the presence of a stationary 
pair of quantum vortices in the medium. 


Three-photon vortices 

The strong interaction, responsible for forming 
two-photon vortex-antivortex pairs, leads to an 
even richer topological structure of the three- 
photon wave function y(@, 2», #3). We will first 
illustrate this by considering only the pairwise 
photon interactions. If only two of the three 
photons are close together, they attract each 
other and form a quasi-bound two-photon state. 
Because the third photon remains unbound, the 
three pairs of vortices resulting from each of the 
three possible quasi-bound states would mani- 
fest as six vortex lines in the (2, 2», #3) space. 

To verify this prediction, we measured the 
three-photon phase (4, ty, tz) and the cor- 
responding correlation function Pt, to, ts) 
over a range of OD. Whereas the stationary 
two-photon correlation function depends on 
a single time separation ¢; — t2, characteriza- 
tion of the three-photon wave function requires 
two time separations. These are conveniently | 
parametrized by the Jacobi coordinates 
n=ty/V2 and ¢ = (t3 + t3)/V6, where 
ty = t; — t; (37). The three-photon measure- 
ments are then given by three-dimensional 
(3D) datasets for 6 and g that depend on 7, 
¢, and OD. 

Figure 3A shows $n, ¢) at the critical OD 
where vortices appear. We identify cores of 
vortex lines with m-phase dislocations: along 
the edges of the six-bar star (|) | = m region) 
and around the center (6 = 0 region). Figure 
3, B and C, shows the development of the phase 
along the lines € = 0 (uniformly separated 
photons) and 7 = 0 (a photon at some distance 
from a pair). Notably, 60, 0) varies mono- 
tonically with OD much faster than 60) 
does, as seen by comparing Fig. 3B with Fig. 2B. 

Eventually, the vortices structure is best 
understood from Fig. 3D, which shows the re- 
constructed 3D isosurfaces |Vo"°)| = 0.7 rad/us, 
with the phase o® indicated by the surface 
color. The starlike structure of tubes in Fig. 
3D corresponds to six vortex lines, in agree- 
ment with our prediction. We can addition- 
ally verify that this structure is due only to the 
two-photon attraction by constructing it from 
the measured two-photon data. This construc- 
tion, shown in fig. S4, exhibits a starlike vortex 
structure nearly identical to that found in the 
three-photon data. 

The vortex star structure can be explained 
analytically by generalizing Eq. 2 to three pho- 
tons. To this end, we replace Wrouna(7”) by a sum 
Wed = Wooua(72) + Vioma (713) + Viomal's2) of 
three quasi-bound states (7 = xv; - 2), respecting 
the bosonic (permutation) symmetry of the three- 
photon wave function. For simplicity, we assume 
a weakly bound state Wiguna(”) = 2e-2!"/¢ ina 
delta potential, where a ~ 7,/A is the scattering 
length (25, 38). As shown in Fig. 3E using the 
spatial Jacobi coordinates 1 = 72:/ /2 and 
C= ("3 + 723) / V6, the generalized Eq. 2 gives 
the six vortex lines. These results support our 
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Fig. 3. Three-photon vortex lines and vortex ring. (A to D) Three-photon con- 
ditional phase ¢©(n, ¢) measured at different OD, where 7 and ¢ are time 
separations between the photons in Jacobi coordinates. For each OD, the data are 
averaged by assuming a sixfold symmetry; an example of data before averaging is 
shown in the inset in (A). We present three cross sections: (A) OD = 79, (B) 

€ = 0, and (C) 7 = 0. In all these, the central feature originates from three-photon 
interactions. In (D), to reveal the vortex structure, we plot the measured 6 along 


interpretation that the vortex lines in the 
three-photon wave function are a direct gen- 
eralization of the two-photon vortices. 

The full three-photon data, however, show 
richer behavior, which cannot be reduced to 
just independent pairwise photon attraction. 
The effect of a third photon approaching a 
bound pair is reflected in the central feature in 
Fig. 3C and, more clearly, as a torus around 77 = 
¢ = Oin Fig. 3D, manifesting a vortex ring. This 
inner ring does not originate from two-photon 
quasi-bound states but rather from a genuine 
three-photon bound state ye ad (34, 37) that 
interferes with the scattering states. 

To describe the formation of the ring, we 
use the following ansatz, adding We na and its 
energy EF; to the generalized Eq. 2: 


w(n, 6,2 


)= eT hh 6) 
= (2-3) 
+e ae + VWecat 


(3) 
where Yecqe = 1 — Wea - Woon: As shown in 
(34, 39), the wave function of a three-photon weakly 
ig state near n = ¢ = 0 can be approximated 
bye a(n, G) o e-VSinl/ag—V 2h V36|/49—VAh+V86 
This form can be interpreted as a product of 
confined states, with three pairs attracting 
each other simultaneously. It is then straight- 
forward to check, as shown in Fig. 3F, that the 
interference between the first and last terms 
in Eq. 3 produces a vortex ring with the same 
topology that we observe experimentally. 

Another important result revealed by the 
6 data in Fig. 3 is that the vortex lines and 
ring appear approximately at the same value 
of OD = 80. This observation is far from 
obvious, allowing us to estimate the relation 
between the bound state energies F3 and £4. 
An analytical model with only pairwise in- 
teraction V predicts E3 = 4E> (40), which 
would result in the vortex ring appearing at 
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a lower OD. Fitting Eq. 3 to the experimental 
data yields E; = 3E», so that e~:* — —1 forms 
a vortex ring simultaneously with e “" = —1 
forming the vortex lines, as shown in Fig. 3F. 
The result |£3/E2| <4 is experimental evi- 
dence of a genuine three-photon repulsion 
term VO, ¢) that attenuates the pairwise 
photon attraction (37, 41). This attenuation 
follows from the physics of the Rydberg block- 
ade, where one photon can “saturate” the in- 
teraction and simultaneously block two (or 
more) other photons. 

The conditional (same-time) phase 0? = 
®) (0,0), which originates from the construc- 
tive sum of the vortex lines and ring, is o? = 
—2n. This nonzero value is evident from the 
monotonic increase of the phase versus OD in 
Fig. 3, B and C, or from the phase winding 
directions in Fig. 3D. Alternatively, one can 
unwrap the 6® data, as shown in fig. $3, to 
find -2x at the center of the ring. Impor- 
tantly, given that o? =) (0) —n, we find 
of Me 20° . This result deviates from the 
known ae say aa ini phase for n 
photons 6(”). = o2)_ n(n — 1)/2, which ascribes 
6). =—8n for 9 2) = =—n (42). Our result 
of) —— 26° ) agrees with the prediction for sat- 
urated interaction (34), again indicating the 
attenuation of the interaction by the block- 
ade saturation. 

The strong three-photon attraction in 
the 4 > 1 regime and the repulsive effect of the 
blockade saturation are also evident in the 
three-photon correlations g presented in 
Fig. 4. Whereas the six crests of g® > laway 
from the center are governed by the two-photon 
quasi-bound states, the structure at the center 
is substantially modified by the three-photon 
bound state. To see this, compare the mea- 
sured g® data in Fig. 4, A to C, with that 
expected from only the pairwise interactions 


¢ bey 


© es 
oS 


isosurfaces of |Vo?) | which are themselves derived from the data. We identify six 
vortex lines (tubes) and a central vortex ring (torus); note the 2x phase twist around 
the cross section of the tubes and torus (inset). (E and F) Analytic calculation 

of the three-photon phase arg[w(n, ¢, R)] using the approximate ansatz in Eq. 3 (E) 
ole and (F) with the three-photon bound-state term, for £3 = 3E> and 

-3we8 (0, 0). The outer vortex lines are due to the generalized two- 
photon bound states, whereas the vortex ring is due to the three-photon bound state. 


(ta) + § + g°?)(tg) — 2 in Fig. 
Z “D to s gi is the pa ne disconnected part 
of the correlations (37). It is evident in all cross 
sections that the central peak in g, reaching 
a maximal value of g0, 0) = 7.4: + 0.3, is twice 
as narrow as the corresponding peak in g®). 
This strong three-photon bunching has been 
characterized before and attributed to the 
nee confinement of Vo na compared with 
eae (34, 37). 

The effect of blockade saturation is best 
captured by the connected part of the correla- 
tions g. = g© - g, in Fig. 4, H and IL. We 
can interpret gO as a manifestation of the 
contribution of the three-photon correction 
term V®, which is repulsive. First, the repul- 
sion attenuates the bunching of simultaneous 
photons, as evident by the negative gO close 
to the center (blue in Fig. 4, G to I). We observe 
a minimal value ofg.”).., = —2.5 + 0.2. Second, 
as photons are less attracted to the center, we 
observe a ring of positive g& (at a radius 


¢ 


& 


4 


€ 


Vm+ CO = 0.25 us; red in Fig. 4G), reaching . 


gO ne = 0.9 + 0.1. As expected, with 4 > 1, both 


go and ol?) ae are an order of magnitude 
larger than the values obtained previously with 
1 «1 (34, 35, 43). Finally, g is modulated 
along that ring, exhibiting six peaks where the 
three photons are uniformly separated in time 
(e.g., along 77, for ¢ = 0). This, too, is due to the 
repulsive correction, which favors equally 
spaced photons over the asymmetric arrange- 
ment of one photon separated from a close 
pair (e.g., along ¢, for 7 = 0). For larger photon 
separations, the regularization effect of Vv 


diminishes, and gO approaches zero. 


Discussion and summary 


Our observation of topological defects—including 
quantum vortex-antivortex pairs, vortex lines, 
and vortex rings—in the few-body wave function 
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Fig. 4. Three-photon bound-state and blockade saturation. (A to C) Three-photon 
correlations gn, ¢, OD) (after sixfold averaging), sectioned at (A) OD = 69, (B) ¢ = 
0, and (C) 7 = 0. (D to F) Disconnected part of the correlations g4(n, ¢, OD), 
calculated from only the g(z, OD) data and representing the two-photon contribution 
to the three-photon correlations. The strong confinement of the three-photon 


of interacting photons is enabled predominantly 
by realizing a strong interaction between the 
photons. The interaction strength is quantified 
by the dimensionless parameter i, which is the 
ratio between the characteristic interaction and 
kinetic energies. Whereas in previous works 
this interaction strength was less than unity, 
our system reaches A = 3. 

The vortices in the n-photon wave function 
determine the maximal conditional phase rae 
achievable for copropagating photons. For two 
photons, the vortex-antivortex pair develops 
when bo” = —n. For three photons, these pairs 
connect to form six vortex lines surrounding a 
vortex ring, the sum of which gives bo x —20. 
The deviation from the value -3n expected for a 
quantum-Kerr nonlinearity is a manifestation 
of the blockade saturation. It demonstrates the 
role played by the vortices in controlling the 
conditional phase, with important consequences 
for deterministic quantum logic operations 
(44, 45) and other quantum nonlinear devices 
(46, 47) with three or more photons. 
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Our findings far from exhaust the potential 
of exploring many-body polariton physics. Even 
for relatively weak interactions, one could 
imagine exotic many-body bound states, such 
as a cascade of Efimov three-body states with 
a scaling-invariant spectrum. Although ini- 
tially proposed for 3D structures (48), Efimov 
states were later predicted to exist for a 
lower number of spatial dimensions (49, 50), 
including quasi-1D Rydberg atom clouds 
(13), but have not been observed thus far. Fur- 
ther, our observation of the genuine three- 
body interaction V“ could be key to the 
realization of four-body interactions, which 
are important for lattice simulations of gauge 
field theories (57). 

As opposed to real space, which is limited to 
three dimensions, the (2, %o, ...) space studied 
in this work can be generalized to more than 
three dimensions, where higher-dimensional 
topological structures, such as linked and knotted 
vortex loops, could be explored. This could 
provide an interesting connection to the stud- 


¢ (1s) 


bound state leads to a tighter bunching feature. (G to I) Connected part of the 
correlations g, = g® - g,©, reflecting the dynamics beyond pairwise interactions. 
These dynamics are regulated by the saturation of the blockade interaction, expressed 
as an effective, short-range repulsive force. The negative central feature (blue), the 
positive ring (red), and the modulation along this ring are all governed by this force. 


ies of nontrivial topological physics in syn- 
thetic dimensions (52, 53). It could further be 
interesting to examine whether and how the 
topological phase singularities of the n-photon 
wave function transform to vortices of the mean 
field in the nonlinear classical optics description 
atn > 1. 

Finally, a twofold increase in the atomic 
density will realize the regime A > x”, where 
higher-order two- and three-photon bound 
states should appear (26, 38). A superposition 
of vortex series arising from different bound 
states would drive complex nonperiodic dynam- 
ics of the few-photon wave function with an 
intricate phase structure. The three-photon 
vortex rings that we observe are a relatively 
simple example of how the phase of the n- 
photon wave function can be controlled by 
an additional (7 + 1) photon. Potentially, these 
topological phase structures of n-photon wave 
function could be harnessed to develop new 
multiphoton control tools and deterministic 
quantum logic. 
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Widespread regulatory specificities between 
transcriptional co-repressors and enhancers 


in Drosophila 


Jelle Jacobs?, Michaela Pagani’, Christoph Wenzl', Alexander Stark?2* 


Gene expression is controlled by the precise activation and repression of transcription. Repression is mediated 
by specialized transcription factors (TFs) that recruit co-repressors (CoRs) to silence transcription, even in 
the presence of activating cues. However, whether CoRs can dominantly silence all enhancers or display distinct 
specificities is unclear. In this work, we report that most enhancers in Drosophila can be repressed by only a 
subset of CoRs, and enhancers classified by CoR sensitivity show distinct chromatin features, function, TF 
motifs, and binding. Distinct TF motifs render enhancers more resistant or sensitive to specific CoRs, as we 
demonstrate by motif mutagenesis and addition. These CoR-enhancer compatibilities constitute an additional 
layer of regulatory specificity that allows differential regulation at close genomic distances and 

is indicative of distinct mechanisms of transcriptional repression. 


nimal development and homeostasis crit- 
ically depend on differential gene expres- 
sion (J, 2), which is enabled by the precise 
regulation of transcriptional activation 
and repression (3-8). Although repres- 
sion is often associated with heterochromatin 
(9, 10), genes can also be silenced in transcrip- 
tionally permissive euchromatin (17-14) by 
repressive transcription factors (TFs), also 
termed repressors, that bind to DNA and re- 
cruit co-repressors (CoRs). Because CoRs can 
suppress transcription even in the presence of 
activators (15, 16), this mode of gene silencing 
is termed active transcriptional repression 
(7, 18). Active repression is critical, and its fail- 
ure can cause developmental defects (19-21) 
and diseases such as cancer (72, 22). It is also 
conceptually intriguing because it requires the 
overriding of activating cues. However, the modes 
and mechanisms of this process are unclear. 

Given that transcriptional activation can oc- 
cur by means of distinct modes (23-25), it is 
intriguing to speculate whether distinct modes 
of active repression exist. Examples of specific- 
ities between CoRs and activators have indeed 
been observed. The CoR retinoblastoma pro- 
tein (Rb) can repress the TFs E2F, Mip120, 
and PU.1 but not others like SP-1 (26-28), and 
CoRs can have different activities in distinct 
transcriptional contexts (29). Comprehensive 
studies that define different modes of active 
repression and uncover their regulatory rules 
are, however, lacking. 

In this work, we determined the mutual 
compatibilities among five known CoRs and a 
genome-wide library of active enhancers by 
measuring enhancer-activity changes upon CoR 
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tethering in otherwise unperturbed cells, sim- 
ilar to activator-bypass experiments (23, 30). 
We reasoned that testing each enhancer with 
each CoR in all possible combinations should 
reveal CoR-enhancer combinations that lead 
to decreased enhancer activity (enhancers are 
sensitive) and those that do not (enhancers are 
resistant), indicative of compatible and incom- 
patible pairings. 


Results 
UAS-STARR-seq for genome-wide screening of 
enhancer-CoR specificity 


We modified the massively parallel enhancer- 
activity assay STARR-seq (self-transcribing 
active regulatory region sequencing) (31) to 
enable the function-based testing of genome- 
wide enhancer-candidate libraries with dif- 
ferent CoRs. Briefly, we introduced four 
upstream-activating-sequence (UAS) motifs 
immediately downstream of the enhancers, 
which leaves the enhancer sequence intact 
but allows for the direct tethering of selected 
CoRs through the Gal4 DNA binding domain 
(Fig. 1A) to assess whether the CoRs can over- 
ride existing activating cues, a process akin to 
active repression. 

We chose Drosophila S82 cells as a model 
system and five CoRs that represent different 
protein complexes, repressive pathways, and 
enzymatic functions (16, 32-34): CoRest, CtBP, 
Rbf, Rbf2, and Sin3A. CoRest, CtBP, Rbf, and 
Sin3A are conserved, essential, and broadly 
expressed, whereas Rbf2 has a more restricted 
expression pattern (35, 36). CoRest mainly re- 
presses neuronal genes, directly interacts with 
histones through its SANT domains, and asso- 
ciates with other conserved repressors, includ- 
ing the demethylase Su(var)3-3 (LSD) and the 
histone deacetylase Rdp3 (HDAC1/2) (32, 37). 
CtBP is recruited by developmental TFs through 
a short PXDLS (Pro-X-Asp-Leu-Ser, where X 


| 


q 


is an unspecified residue) peptide motif chee 


pdz 


sembles p-2-hydroxyacid dehydrogenases, '\~.-—— 


requires nicotinamide adenine dinucleotide 
(NAD) or its reduced form (NADH) (38-40). 
Rbf is a prototypical tumor suppressor of the 
retinoblastoma family (47) and a main tran- 
scriptional regulator of cell cycle genes as a 
part of the DREAM complex, where it counter- 
acts E2F by binding E2F’s transactivation do- 
main (42-44). Rbf2 is a Drosophila-specific 
Rbf paralogue that has specialized and over- 
lapping functions (34, 45). Sin3A is required 
for repression and activation of target genes 
of various functions and likely mediates re- 
pression through its HDAC-interaction domain 
(46, #7). Testing diverse CoRs should allow us 
to detect compatible and incompatible CoR- 
enhancer pairs. For each CoR, we performed 
two independent UAS-STARR-seq screens in 
which we cotransfected the UAS-STARR-seq 
library with a vector that expresses the Gal4- 


CoR [or Gal4-GFP (green fluorescent pro- ° 


tein) as neutral control; Fig. 1A] and spike-in 
controls for normalization. In all cases, the 
two independent replicates correlated well 
[Pearson correlation coefficient (PCC) > 0.84; 
fig. S1, A to F]. To reliably assess repression 
(which requires a high baseline activity), we 
evaluated enhancer-activity changes for 3094 
enhancers that were highly active in Gal4-GFP 
controls. 


CoRs CtBP and Rbf2 silence different subsets 
of enhancers 


We first tested CtBP and found that that some 
enhancers, such as the enhancers near Orct, 
Orct2, and Pka-C3, were reproducibly repressed 
by Gal4-CtBP, whereas others, such as the en- 
hancer near CG10516, were unaffected (Fig. 
1B). Comparing the two replicates per condi- 
tion using edgeR showed that CtBP signifi- 
cantly [false discovery rate (FDR) < 0.05, logs 
(fold change) (logsFC) < 0] repressed 759 en- 
hancers but not the remaining 2335 (Fig. 1C). 

Rbf2 was also able to repress only a subset 
of the enhancers (1733, 56%; Fig. 1D), includ- 
ing the enhancers near Orct and Pka-C3, which 
were also repressed by CtBP. In contrast to 
CtBP, Rbf2 could repress the enhancer near 
CG10516 but not the Orct2 enhancer, indicat- 
ing that Rbf2 and CtBP have different specific- 
ities (Fig. 1B). Indeed, whereas 502 enhancers 
were repressed by CtBP and Rbf2, 1231 en- 
hancers were repressed by Rbf2 but not by 
CtBP and 257 enhancers were repressed by CtBP 
but not by Rbf2 (Fig. 1E). 

The enhancer-CoR specificities were vali- 
dated in luciferase reporter assays that showed 
that an enhancer near serpent was repressed by 
Gal4-CtBP but not Gal4-Rbf2, an enhancer near 
CG2116 was repressed by Gal4-Rbf2 but not 
Gal4-CtBP, and an intronic enhancer of kay 
was repressed by both CoRs, each in agreement 
with the STARR-seq results (Fig. 1F). 
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Fig. 1. UAS-STARR-seq maps specificities between CoRs and active enhancers. 
(A) Schematic of the UAS-STARR-seq high-throughput enhancer-CoR assay. 
Gal4-GFP or a Gal4-CoR is recruited to the UAS sites near a genome-wide library 
of candidate enhancers. ORF, open reading frame; pAA, poly(A) site. 

(B) Screenshot from the University of California Santa Cruz (UCSC) browser 
visualizing the activity of nine enhancers (peaks) when GFP (green), the CoR 

CtBP (brown), and the CoR Rbf2 (red) are recruited. (© and D) MA plots of initial 
enhancer activity [logo(enhancer activity Gal4-GFP)] on the x axis and repression by 


CtBP (C) or Rbf2 (D) on the y axis [logo(enhancer activity Gal4-CoR over 
Gal4-GFP)]; n = 3094. (E) Scatterplot contrasting the sensitivity to repression 
(logoFC) of each enhancer (n = 3094) toward CtBP and Rbf2. Orange indicates 
enhancers sensitive to both CoRs (FDR < 0.05 and logsFC < 0), brown indicates 
enhancers sensitive to CtBP, and red indicates enhancers sensitive to Rbf2. 

(F) Luciferase (Luc) validations. For each plot, the tested enhancer is marked 
with a transparent red column and the normalized luciferase activity is displayed 
as horizontal bar plots; n = 2. 


Differential sensitivity to five CoRs define 
distinct types of enhancers 

Screening three additional CoRs—CoRest, Rbf, 
and Sin3A—revealed that each was able to re- 
press a specific subset of enhancers (Fig. 2A 
and fig. S1, G to I). The overall repression pro- 
files of CoRest and Sin3A were similar to that of 
CtBP (PCCs of 0.83 and 0.7, respectively) and 
clearly different from Rbf2 (PCCs of 0.28 and 


Jacobs et al., Science 381, 198-204: (2023) 14 July 2023 


0.24, respectively; fig. SIK); Rbfs repression pro- 
file was the most different from that of the other 
CoRs (PCC < 0.05). The differences between 
CtBP and Rbf repression persist when they are 
recruited upstream of the promoter or enhancer, 
despite changes in CoR efficacies between the 
positions (supplementary text 1 and fig. S2). 
For each CoR, we measured a wide range of 
repression strengths: Whereas the activity of 


some enhancers was reduced to background 
levels, other enhancers were only mildly af- 
fected (Fig. 1, C and D, and fig. S1, G to I). Both 
complete shutdown and mild (or soft) repression 
are likely biologically relevant (48). The CoR- 
enhancer sensitivities that we observed corre- 
late with where these CoRs operate in the 
genome, because chromatin immunoprecipita- 
tion sequencing (ChIP-seq) signals (34, 49-57) 
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Fig. 2. Enhancer clusters differ in chromatin marks, function, and TF binding. lower quartiles, and whiskers are minimum and maximum values. (D) Heatmaps 
(A) Screenshot from the UCSC browser displaying the effect of each CoR on visualizing endogenous ChIP-seq coverage over the enhancers in the clusters. Shown 


enhancer activity. (B) Heatmap visualizing all enhancers from the genome-wide 
screen, grouped into five clusters based on their similarity in repression profile. The 
logaFC in enhancer activity (CoR over GFP) is plotted, and repressed enhancers are 
marked in blue, unaffected in white, and activated in red. (C) Boxplots summarizing 
the effect that each CoR has on the activity of enhancers from the five clusters. 
For each boxplot, the center line represents the median, box limits are upper and 
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from left to right are STARR-seq activity under Gal4-GFP condition; endogenous 
chromatin marks H3K27ac, H3K4mel, and H3K4me3; and the TFs DREF, MIBP, 
and Trl. (E) Balloon-plot visualizing GO term enrichment over the five clusters 
(enhancer to nearest gene). Circle size indicates significance [-logioFDR], and 

the gene ratio is color coded. (F) Heatmap of significantly enriched or depleted TF 
motifs [logo(odds ratio motifs in selected cluster versus all other clusters)]. 
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Fig. 3. Specific TF motifs confer resistance against distinct repressors. (A) Screenshot from the UCSC 


browser displaying enhancer activity under GFP, CoRest, 


CtBP, and Rbf. DREF binding (ChIP-seq) is shown in 


black. (B) Heatmap of all enhancers ranked based on their sensitivity to CoRest. The ChIP-seq signal of DREF 


is displayed and quantified in boxplots (p < 2.2 x 107), and the occurrence of DREF motifs 


is plotted. (C) Boxplot 


showing the log2FC of enhancer activity when CoRest (left), CtBP (middle), or Rbf (right) is recruited over 


GFP. Each enhancer is present as original (magenta) an 
mutated; wt, wild type. (D) Boxplot showing the logsFC 
recruited over GFP. Each enhancer is present as origina 


id with its DREF motifs mutated (gray); n = 65. mut, 
of enhancer activity when Rbf (left) or CtBP (right) is 
(cyan) and with its ETS motifs mutated (gray); n = 157. 


(E) Boxplot showing the logoFC of enhancer activity when Rbf2 (left) or CtBP (right) is recruited over GFP. 


Each enhancer is present as original (orange) and with 
summarizing the effects of motif mutations on enhance 
wild-type enhancers over repression when specific motif 
the median, box limits are upper and lower quartiles, an 


tend to be enriched at enhancers that the re- 
spective CoR is able to repress (fig. S3, A to E). 
The UAS-STARR-seq results also agree with 
the differential repression of PCNA and CycB 
after overexpressing Rbf or Rbf2 in S2 cells 
(45) (fig. S3, F and G). 

We clustered the enhancers into five groups 
based on their sensitivity to the CoRs (og,FC 
against Gal4-GFP) using a self-organizing tree 
algorithm (SOTA) with the PCC as a distance 
metric (Fig. 2B). Cluster 1 contains CoRest- 
and CtBP-resistant enhancers, whereas en- 


its Tr 


motifs mutated (gray); n = 127. (F) Heatmap 
sensitivity to the different CoRs (log2FC of repression 
is mutated). For each boxplot, the center line represents 
d whiskers are minimum and maximum values. 


Sin3A. Cluster 3 contains enhancers that are 
resistant to Rbf2 and enhancers in cluster 4 
are sensitive to all CoRs, whereas enhancers in 
cluster 5 are overall sensitive to repression but 
resistant to Rbf (Fig. 2C). 


Enhancers clustered by CoR sensitivity differ in 
chromatin marks and function 


We investigated whether these enhancer clus- 
ters differ in additional properties that correlate 
with their behavior toward the CoRs. Initial 
enhancer activity, as measured by UAS-STARR- 


hancers in cluster 2 are resistant to CtBP and 
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ters (Fig. 2D and fig. S4A). Acetylated histone 
H3 lysine 27 (H3K27ac), a histone modification 
that marks active enhancers (52), was similarly 
enriched at the endogenous enhancer loci in 
all five clusters (Fig. 2D and fig. S41). This sug- 
gests that the distinct specificities did not 
stem from the differences in initial enhancer 
strengths. Methylated histone H3 lysine 4 
(H3K4mel1) and trimethylated histone H3 
lysine 4 (H3K4me3) levels did show differ- 
ential and complementary trends: H3K4mel1 
showed the highest enrichment in cluster 5, 
followed by 4 and 3, whereas H3K4me3 was 
more highly enriched in enhancer clusters 
1 and 2. This suggests that clusters 1 and 2 
are housekeeping-type enhancers (24) (Fig. 2D 
and fig. S4, B and C), whereas cluster 5 en- 
hancers are developmental or cell type-specific 
enhancers (53). Evaluating chromatin accessi- 
bility and H3K27ac levels in other cell types 
and tissues (31, 54, 55) confirmed that cluster | 
land 2 enhancers were more globally active, 
whereas cluster 5 enhancers were S2-cell spe- 
cific (fig. S4, H to N). The five clusters also 
differed in the functions of the enhancer-linked 
genes: Genes linked to Rbf-sensitive clusters 
1 and 2 are enriched for gene ontology (GO) 
terms like mitotic cell cycle or mRNA meta- 
bolic process, whereas genes linked to CtBP- 
sensitive cluster 5 are specifically enriched 
for development and morphogenesis GO 
terms (Fig. 2E). These results are consistent 
with the established functions of Rbf and 
CtBP in cell cycle regulation and development, 
respectively (56, 57). Moreover, cluster 5 en- 
hancers were the most strongly repressed by 
CoRest, CtBP, and Rbf2 (Fig. 2B), which sug- 
gests that developmental or cell type-specific 
enhancers might be more sensitive to repres- 
sion by these CoRs. 


TF binding and motifs differ between 
enhancer clusters 


Because active enhancers are known to func- 
tion through a variety of TFs, we hypothesized 
that their differential response to the CoRs 
might be linked to the specific TFs that they 
bind. Using published TF ChIP-seq data, we 
indeed observed that prominent TFs were 
bound to enhancers from specific clusters 
and absent from others. The TFs DREF (58) 
and M1BP (59), for example, bound almost ex- 
clusively to enhancers from clusters 1 and 2, 
with DREF preferring cluster 1 and MI1BP clus- 
ter 2 (Fig. 2D and fig. S4, D and E). Conversely, 
trithorax-like (60) (Trl, also called GAGA factor 
or GAF) was absent from these clusters and 
instead mainly bound to enhancers from clus- 
ter 3 and, to a lower extent, to cluster 4 (Fig. 
2D and fig. S4F). The distribution of these 
three TFs suggests that differential CoR sen- 
sitivities might relate to distinct TFs. 

To identify associations between CoR sensi- 
tivities and TFs in a more comprehensive 
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manner, we performed TF-motif enrichment 
analyses for the enhancers in the five clusters 
using 6502 TF motifs from the iRegulon data- 
base (67). Consistent with the ChIP-seq results, 
motifs for DREF, M1BP, and Trl were specif- 
ically enriched in clusters 1, 2 and 3, respec- 
tively (Fig. 2F). We found additional motifs 
that were also differentially enriched between 
the clusters (Fig. 2F). These data together sug- 
gest that the different enhancer clusters, de- 
fined by their sensitivity toward CoRs, are 
associated with distinct chromatin features, 
functions, and TF motif content. 


Identification of TF motifs that predict 
sensitivity and resistance to each CoR 


To directly test the association between TF 
motifs and sensitivity toward each CoR, we 
evaluated the enrichment of TF motifs within 
enhancers sensitive or resistant to each CoR 
(see supplementary text 2 and fig. S5 for a 
similar analysis of coactivator binding). Motif 
enrichment profiles across all five CoRs revealed 
an intricate relationship between enhancer- 
CoR sensitivity and TF motifs with highly 
distinctive enrichments (fig. S6, A to F). Con- 
sistent with the conserved role of Rb proteins 
as part of the DREAM complex (27, 42), E2F, 
DP, and Mip120 motifs were enriched in Rbf- 
sensitive enhancers. These three motifs, as 
well as DREF and M1BP motifs, were enriched 
in enhancers that were resistant to CoRest, 
CtBP, and Sin3A repression (fig. S6F). Con- 
versely, motifs for the developmental TFs GATA, 
API, and TEAD were enriched in enhancers 
sensitive to CoRest, CtBP, and Sin3A and re- 
sistant to Rbf and Rbf2. Furthermore, Trl and ETS 
motifs were specifically enriched in enhancers 
that were resistant to Rbf2 and Rbf, respec- 
tively, but sensitive to all other CoRs (fig. S6F). 
These results indicate that for each CoR, cer- 
tain TF motifs are specifically and strongly en- 
riched in sensitive and resistant enhancers. 

Given the differential motif enrichments, 
we next tested whether TF motif content is 
predictive of an enhancer’s CoR sensitivity. 
For this, we trained a generalized linear model 
(62) using TF motif counts as features and en- 
hancer sensitivity to a given CoR as the re- 
sponse variable. For each CoR, when using 
10-fold cross-validation, the models based on 
motif counts performed well and were able to 
predict an enhancer’s sensitivity and resist- 
ance to a given CoR (PCCs between 0.46 and 
0.73, p values of <2.2 x 107"; fig. S6G). Over- 
all, these results establish a strong association 
between CoR sensitivity and TF motif content 
that is predictive and might correspond to 
causal relationships. 


Distinct TF motifs are required for resistance to 
specific CoRs 


We noticed that DREF motifs were enriched 
in enhancers that were resistant to CoRest 
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and CtBP (Fig. 2F and fig. S6F). Enhancers 
bound by DREF, such as enhancers near the 
RYBP and Jupiter genes, are resistant to re- 
pression by CoRest and CtBP but sensitive 
to repression by Rbf (Fig. 3A). Ranking all 
enhancers based on their sensitivity to re- 
pression by CoRest or CtBP confirmed that 
resistant enhancers were significantly (p < 
2.2 x 107'°) enriched for DREF motifs and 
binding according to ChIP-seq (58) (Fig. 
3B). Given the correlation between CoRest 
and CtBP resistance and presence of DREF, 
we considered that DREF might protect en- 
hancers against CoRest- and CtBP-mediated 
repression. To test whether DREF motifs 
are required for resistance, we mutated the 
DREF motifs in 65 enhancers and found that 
the mutated enhancers were significantly more 
sensitive to repression by CoRest and CtBP (p = 
4.1 x 10" and 2.5 x 10°"°), whereas their sen- 
sitivity toward Rbf did not change (Fig. 3C 
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and fig. S7, A to C). We conclude that these 
enhancers require DREF motifs for resistance 
to CoRest and CtBP. 

Several other TF motifs were predicted to 
confer resistance to repression by specific CoRs. 
ETS-family motifs, for example, were specifi- 
cally enriched in enhancers that were resistant 
to Rbf repression (fig. S6F). We mutated the 
ETS motifs in 157 enhancers and observed in- 
creased sensitivity toward Rbf-mediated repres- 
sion (p = 1.49 x 10 ”; Fig. 3D and fig. S7F). The 
opposite trend was observed for the other 
CoRs, which were all able to repress wild-type 
enhancers containing ETS motifs better than 
their mutated counterparts, suggesting that 
ETS TFs are, in general, sensitive to repression 
(Fig. 3, D and F, and fig. S7, G and I). Similarly, 
Trl motifs and ChIP signals were enriched in 
enhancers that were specifically resistant to 
Rbf2 repression (fig. S8, A and B), and mutat- 
ing these motifs in 127 enhancers led to a small 
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Fig. 4. Effects of TF motif addition on the sensitivity of enhancers to repression. (A) Experimental setup of 
the motif paste experiments. Two TF motifs are pasted into the sequence of sensitive enhancers. The activity 

of original and motif paste enhancers is measured by UAS-STARR-seq under control (Gal4-GFP) and repressor 
(Gal4-CoR) conditions (data S2). cp, core promoter. (B) Boxplot showing the logoFC of enhancer activity when 


CoRest (left), CtBP (middle), or Rbf 


(right) is recruited over GFP. Each enhancer is present as original (black) 


and with two DREF motifs pasted (magenta); n = 110. (C) Boxplot showing the logsFC of enhancer activity when Rbf 
(left) or CtBP (right) is recruited over GFP. Each enhancer is present as original (black) and with two ETS motifs 
pasted (cyan); n = 107. (D) Boxplot showing the logsFC of enhancer activity when Rbf2 (left) or CtBP (right) is recruited 
over GFP. Each enhancer is present as original (black) and with the addition of two Trl motifs (orange); n = 93. 

(E) Heatmap summarizing the effects of motif additions on enhancer sensitivity to the different CoRs (logaFC of 
repression with motif addition over repression without motif addition). For each boxplot, the center line represents 
the median, box limits are upper and lower quartiles, and whiskers are minimum and maximum values. 
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but significant increase in sensitivity toward 
Rbf2, but not toward the other CoRs (Fig. 3, E 
and F, and fig. S8, C, D, and F). 


Specific TF motifs can turn sensitive enhancers 
into resistant ones 


Given that specific TF motifs are required for 
CoR resistance, we wondered whether these 
motifs are also sufficient to drive resistance. 
To test this, we introduced two “resistant” TF 
motifs [10 base pairs (bp) each] into CoR- 
sensitive enhancers at positions deemed un- 
important for enhancer activity (63) (see table 
$2) and evaluated the enhancers’ sensitivity to 
repression (Fig. 4A). As a control for the addi- 
tion of motifs, we also introduced two neutral 
TF motifs at the same positions (of the ACE2 
and FOX TFs), which were predicted to not 
affect the sensitivity to repression. 

Because DREF motifs were required for re- 
sistance to CoRest- and CtBP-mediated repres- 
sion (Fig. 3C), we tested whether they were 
sufficient to render CoRest- and CtBP-sensitive 
enhancers more resistant. Overall, the intro- 
duction of two DREF motifs was sufficient to 
protect the enhancers from CoRest- and CtBP- 
mediated silencing (p = 1.59 x 10°" and 3.71 x 
10°”, n = 110; Fig. 4B and fig. S9, A and B), and 
this increased protection was specific and due 
to the DREF motifs, because control motifs 
had no effect on sensitivity (fig. S9, E and F). 
Similarly, ETS motifs were necessary (Fig. 3D) 
and sufficient (Fig. 4C and fig. SIOA) to desen- 
sitize enhancers from repression by Rbf (p = 
1.06 x 10", n = 107), and Trl motifs were neces- 
sary (Fig. 3E) and sufficient (Fig. 4D and fig. SIOF) 
for Rbf2 resistance (p = 2.01 x 107°, n = 93). 

Whereas DREF and ETS motifs modulated 
the enhancers’ activities (figs. S7, D and H; 
S9D; and S10C), Trl motifs had only minor ef- 
fects on enhancer activity (Gal4-GFP; figs. SSE 
and S10H). Moreover, in each case, the gain 
and loss in resistance was CoR specific, be- 
cause these respective motifs had little effect 
on the repression by other CoRs or even made 
the enhancers more sensitive (Fig. 4E and figs. 
$7 to S10). The addition of ETS or Trl motifs, 
for example, made enhancers more sensitive 
to all other CoRs except for Rbf or Rbf2, re- 
spectively (Fig. 4E and fig. S10), whereas DREF 
motifs increased sensitivity toward Rbf (p = 
6.36 x 10“). This might either stem from the 
CoRs ability to directly antagonize the activity 
of these additionally bound TFs or because 
these TFs increase the overall CoR suscepti- 
bility of the enhancers. Hence, introducing 
specific TF motifs changed the enhancers’ 
sensitivity to repression by a given CoR along a 
spectrum of effect sizes in a specific and pre- 
dictable manner. 


Discussion 


In this work, we show that transcriptional en- 
hancers are differentially susceptible to active 
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Differentially repressed close-by enhancers 


A 
<= a mySe a 
cone ih 
Gal4-GFP 
Gal4-CoRest 
Gal4-Rbf ai A 
B 


Summary repressor enhancer specificities 


Kaye cee 
REREEEn fig kay mmf 33> 
Gal4-GFP 
Gal4-CtBP iD itis, 


Gal4-Rbf2 io A 


S41 2AISUaS 
resistant TFs 


©@@ 


enhancers 


S41 }Ue}SISOl 
sensitive TFs 


Fig. 5. Implications of repressor-activator specificity. (A) Example regions with two closely spaced 
but differentially repressed regulatory regions. Enhancers differentially repressed by CoRest and Rbf near the . 
fs(1)h and mys genes are shown on the left, and enhancers differentially repressed by CtBP and Rbf2 

near the fig and kay genes are shown on the right. (B) Summary schematics of the uncovered specificities 
between CoRs and activators, illustrating an additional layer in transcriptional regulation. Depending on 
motif content (magenta, cyan, or gray), enhancers can be sensitive to one CoR while resistant to another one, 
and different combinations lead to intermediate responses. 


repression by prominent CoRs. This additional 
layer of regulatory specificity enables the dif- 
ferential repression of closely spaced enhancers 
and genes. For example, CoRest could repress 
the intronic mys enhancer without affecting 
the neighboring fs(Dh gene, whereas Rbf could 
do the opposite (Fig. 5A, left). Similarly, CtBP 
could repress both enhancers in the kay/fig 
locus, whereas Rbf2 could repress the enhancer 
closest to fig but not the other (Fig. 5A, right). 

Although the effects are of varying size, the 
uncovered specificities between repressors and 
enhancers reveal a layer of “resistance” against 
repression (Fig. 5B). Specific motif combina- 
tions of activating and repressive TFs, together 
with the regulatory proteins present in each 
cell type, will largely determine when and 
where a regulatory region is active or repressed 
(6, 64). In addition, the ability of TFs to confer 
resistance also allows for enhancers and genes 
to be activated by derepression through TF 
recruitment, as we demonstrate by motif addi- 
tion (Fig. 4). This additional layer of regulation 
circumvents the requirement to remove the 
involved repressors, and the interplay between 


TFs and CoRs provides more flexibility to the 
regulatory system (Fig. 5B). 

That one mode of activation can be sensitive 
to one repressor but resistant to another im- 
plies that distinct activation and repression 
mechanisms must converge. Active repression 
might directly interfere with specific factors 
or defined steps of transcriptional activation. 
Certain CoRs might, for example, inactivate 
specific TFs or coactivators but not others [e.g., 
through posttranslational modifications (65, 66)] 
or counteract their downstream activating 
mechanisms. Alternatively, certain TFs might 
directly counteract a repressor’s function or 
bypass the rate-limiting step of initiation or 
elongation controlled by the CoR. Discerning 
the distinct mechanisms of activation and re- 
pression and how they intersect and coordi- 
nate will be of great future interest. 
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QUANTUM OPTICS 
Biasing the quantum vacuum to control macroscopic 
probability distributions 


Charles Roques-Carmes!*++, Yannick Salamin**+, Jamison Sloan’, Seou Choi‘, Gustavo Velez’, 
Ethan Koskas’, Nicholas Rivera*, Steven E. Kooi*, John D. Joannopoulos2*, Marin Soljaéic' 


Quantum field theory suggests that electromagnetic fields naturally fluctuate, and these fluctuations 
can be harnessed as a source of perfect randomness. Many potential applications of randomness 

rely on controllable probability distributions. We show that vacuum-level bias fields injected into 
multistable optical systems enable a controllable source of quantum randomness, and we demonstrated 
this concept in an optical parametric oscillator (OPO). By injecting bias pulses with less than one 
photon on average, we controlled the probabilities of the two possible OPO output states. The potential 
of our approach for sensing sub-photon-level fields was demonstrated by reconstructing the 

temporal shape of fields below the single-photon level. Our results provide a platform to study quantum 
dynamics in nonlinear driven-dissipative systems and point toward applications in probabilistic 


computing and weak field sensing. 


ne of the most fundamental features of 
quantum physics are vacuum fluctua- 
tions (J), which underlie many intriguing 
phenomena (2-8). Consequently, there 
has been interest in measuring vacuum 
fluctuations, either directly (9-1)), or indirectly 
(72, 13). Sensitive measurements of energy lev- 
els, forces, or fields are often required to discern 
the impact of vacuum fluctuations. However, 


Fig. 1. Tuning macroscopic probability distri- 
butions by biasing vacuum fluctuations. 

(A and C) Fluctuation snapshot, Hamiltonian, 
and associated probabilities for an (A) unbiased 
and (C) biased bistable system. In (C), a 
vacuum-level bias field is injected in the cavity. 
(B) Schematic of quantum optical multistable 
system. (D) Stochastic amplitude trajectories for 
a biased optical parametric oscillator (OPO). One 
hundred trajectories are plotted in each graph, 
and time is normalized to the cavity lifetime 

tcav. (E) Probability of state a as a function 

of bias intensity b. 
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certain nonlinear systems are so sensitive to 
initial conditions that vacuum fluctuations crit- 
ically influence the outcome of classical ob- 
servables. Famously, multistable systems that 
spontaneously break continuous or discrete 
symmetries reveal the statistical nature of the 
vacuum (J4), in which stochastic vacuum 
forces randomly pick one of the steady states; 
this makes vacuum fluctuations a source of 


q 


randomness (15). In optics, this phenome Chec 
occurs at the thresholds of lasers and op 
parametric oscillators (OPOs) (J6, 17). As a 
result, these optical platforms have been thor- 
oughly explored as generators of true and 
ultrahigh-rate random numbers (18-23). 
These quantum sources of randomness are 
also promising for many applications. For ex- 
ample, stochastic components are essential 
across a range of computing tasks (24-29). Prob- 
abilistic computing (27) in particular has shown 
promise for speeding up optimization and in- 
ference tasks (30-33). Its central building block 
is a probabilistic bit (p-bit): a stochastic logical 
unit described by a controllable probability 
distribution (for example, producing 1 with a 
probability p, and 0 with probability 1 - p) (30). 
The prospect of leveraging vacuum fluctuations 
as a source of controllable randomness for pho- 
tonic probabilistic computing is attractive, 
given the recent demonstration of ultrahigh- 
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Fig. 2. Experimental demonstration of a photonic p-bit in a biased OPO. 
(A) Schematic of the experimental setup. AM, amplitude modulator; BS, beam 
splitter; PBS, polarization beam splitter; dB, bias attenuation (combination of 
neutral density filters, pinhole, and polarization optics); 2/2, half waveplate; 

, spherical mirror; PPLN, 


DM, dichroic mirror; PD, photodiode; FM, flat mirror; S 


bandwidth random-number generators (23) 
and their potential integration in low-latency 
and power-efficient photonic processors (34). 

A substantial gap still separates the current 
state of the art from the proposed controlled 
quantum randomness in optics. Specifically, 
all existing optical random-bit generators rely 
on perfect symmetry, so that vacuum fluctua- 
tions yield unbiased outcomes. The other ex- 
treme for nonlinear optical systems lies in 
externally seeded devices (35-37), in which the 
steady state is completely determined, sac- 
rificing any notion of randomness. The exis- 
tence of these two extremes—perfect unbiased 
randomness and complete determinism— 
raises questions about what physics may lie 
in between. 

In this work, we show that the injection of 
vacuum-level fields into a multistable optical 
system enables a controllable source of biased 
quantum randomness. We show that quantum 
biased randomness can be realized in various 
nonlinear photonic systems, and we demon- 
strate our concept in a biased OPO, where the 
random variable is the phase of the signal 
field, which can take on values 0 and x. We 
generated a photonic p-bit, with a parameter 
p that can be controlled by the bias amplitude 
and phase. We then elucidated the physical 
origin of this biased symmetry breaking by 
showing that this biased randomness arises 
directly from interference between the bias 
field and vacuum fluctuations. Last, we show 
that vacuum-level biased optical systems can 
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probe vacuum field fluctuations, as well as 
the temporal profile of very weak electromag- 
netic pulses, down to much less than one (~10™) 
photon per pulse. 


Controlling macroscopic observables with 
coherent injection of vacuum-level fields in 
quantum optical multistable systems 


We considered a physical system with Hamil- 
tonian Ho, which possesses multiple degen- 
erate stable steady states (multistable). For 
concreteness, we considered a bistable sys- 
tem described by two degenerate ground states 
o® (é € {0, 3} (Fig. 1A). The system is initiated 
in the unstable state that lies at the local 
maximum of the potential between the two 
degenerate states. This unstable initial state 
will then decay into one of the degenerate en- 
ergy minima with equal probability (Fig. 1A, 
center). The resulting random outcome stems 
from random field fluctuations (Fig. 1A, left) 
in the initial state. Nonlinear quantum optical 
multistable systems provide a natural platform 
to implement Ho, and macroscopic probabil- 
ity distributions can be extracted by repeated 
phase measurements (Fig. 1B). 

We focused on one possible implementa- 
tion of Fig. 1B: a bistable optical system based 
on a degenerate biased OPO. The essential 
component of a degenerate OPO is a second- 
order nonlinear crystal in an optical cavity. 
When a pump with frequency 2m is coupled 
into the crystal, oscillations at the signal fre- 
quency @ are induced through parametric am- 
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Bit number 


periodically poled lithium niobate nonlinear crystal; LO, local oscillator; 

F, spectral filter; and PZT, piezoelectric actuator. (B) Probability measurement 
for various maximum bias powers P; and varying waveplate angle 0, over 
20,000 samples. (Inset) Schematic of the bias control setup in this experiment. 
(C) Bit stream portions at 6 = 45° 


plification. The resulting steady-state signal 
field can take on two distinct phases, 0 and x— 
respectively, a and o®. In the absence of any 
injected bias field at the signal frequency, the 
OPO steady state above threshold approaches 
a mixed state of the two possible phases, with 
equal likelihood (20). 

The situation changes when a bias field is 
injected into the cavity (Fig. 1C). Although add- 
ing a coherent field is known to break the sym- 
metry of such systems, we specifically considered 
vacuum-level bias fields, which are on the order 
of the fields associated with vacuum fluctua- 
tions. By tuning the injected bias field, a mixed- 
state density matrix is pias oT the form 
p = (1—p)la )(a | + pla )(at i) realiz- 
ing a binomial distribution G,: Pra y=1-p 
and P(a) = p, with p € [0, I]. The probability 
p is given by the analytical expression 


wife) +] 0 


In this expression, erf is the error function; 0 is 
the bias field, which is defined so that b” is 
the number of bias photons present in the 
signal pulse in the cavity during each round 
trip; and 6 the phase of the bias field relative 
to the OPO signal. Furthermore, bg is the crit- 
ical bias field level required to modify the prob- 
ability, which is of order unity, corresponding 
to single-photon-level bias. Amplitude tra- 
jectories for an OPO with weak gain and var- 
ious levels of bias power are shown in Fig. 1D. 
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Fig. 3. Biased coin flips A 
from the quantum vacuum. 
(A) Stochastic trajectories 
of OPO amplitude. (Inset) 

Zoom-in of the trajectories 


at early times. (B) Four-step 
model evolution of the H 


probability distribution. 


(C) Statistical model of the 0 
biased OPO as a biased 
coin flip, where the 
parameter p is determined 


Bias + gain 


1 
by the positive-value area ‘ 
under the ground-state F 
wave function density } 
Wo(E)|° shifted by Epeak. q 
(D) Reconstructed wave ; 
function density |%o(E)|? i 
of the electromagnetic i 
ground state. (E) Transition {| a 

1 
1 
1 
1 
1 
1 


between purely random 
and deterministic regimes 
of the OPO. The gray line is 
the theoretical prediction 
without any fitting parame- 
ters. For readability, only 
every other angle data 
point with probability 

p > 0.5 is shown. 


-2000 0 


E_ (Vim) 


vac 


The derivation of the bias-probability rela- 
tionship from a rigorous quantum mechan- 
ical model is given in the supplementary 
text, section B (38). The theory consists of 
writing the Heisenberg-Langevin equation 
of motion for the weakly biased OPO system 
and solving an equivalent set of classical 
stochastic differential equations. A general- 
ization of this concept to multistable sys- 
tems (such as lasers) and considerations on 
the implementation of p-bits in ultrafast op- 
tics are presented in the supplementary text, 
section A (38). 


Experimental demonstration of a photonic 
p-bit in a biased degenerate optical 
parametric oscillator 


We then demonstrated a vacuum-level biased 
OPO (Fig. 2A) that consists of three main parts: 
(i) a degenerate OPO in a bow-tie cavity (Fig. 
2A, blue shaded area); (ii) OPO phase interfer- 
ometric measurement (Fig. 2A, orange shaded 
area); and (iii) temporal and amplitude con- 
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trol of the bias field (Fig. 2A, green shaded 
area). A complete description of the experimen- 
tal setup is given in (38) and the supplementary 
text, section C. 

The measured probability p is shown in 
Fig. 2B, where the bias field amplitude sets 
the p-bit probability. The bias field amplitude 
was set by a fixed attenuation (corresponding 
to the maximum power P;) and fine tuned by 
rotating the half waveplate angle, so that the 
bias power is given by P;sin?(20). We fitted 
our experimental data to the functional form 
from Eq. 1, showing excellent agreement with 
our model. The bit probabilities change in 
response to bias field attenuation as expected 
from theory. 

By changing the attenuation level of the 
bias field, we were able to traverse the con- 
tinuous space between perfect randomness 
and determinism. Segments of the bit stream 
statistics (Fig. 2C) show a transition from a 
slightly skewed binomial distribution B, with 
p ~ 0.5 (Po) to purely deterministic outcomes 
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(Ps). The realized p-bit exhibits the main prop- 
erties necessary for its use in probabilistic 
computing platforms (27): statistical indepen- 
dence of consecutive samples and continu- 
ous tunability of the probability distribution 
(supplementary text, section A). Because the 
p-bit outcome is in the optical domain, it could 
be further processed by an optical analog pro- 
cessor (34). 


Sensing vacuum fluctuations and 
sub-photon-level fields with photonic p-bits 


We next demonstrated that the observed prob- 
ability control is the result of bias fields at 
the level of vacuum fluctuations. Shown in 
Fig. 3A are sample trajectories simulated by 
using stochastic differential equations de- 
rived from a rigorous quantum mechanical 
model of a biased OPO. Examining the prob- 
ability distribution of fields at four character- 
istic times (marked I to IV) during evolution _ 
from the initial condition allowed us to de- 
velop a four-step model of how the biased 
distribution develops, from the initial vacuum 
fluctuations (I), undergoing bias and gain (II), 
then bifurcating (IID, before reaching steady 
state (IV). 

Our experimental data are quantitatively 
consistent with these theoretical predictions. 
The experiments shown in Fig. 2B are realized 
with ~3 x 10° to 4: photons per pulse (depend- 
ing on the total attenuation level). In Fig. 3E, 
we plotted the measured probability from var- 
ious bias power levels P; against the estimated 
number of photons per pulse, matching our 
theory without any fitting parameter. Our 
theory accurately predicted the absolute level 
of bias fields required to influence the proba- 
bility in our experiments (on the order of one 
photon per pulse). 

Equivalently, the critical bias peak field is 
comparable with the amplitude of the vacuum 
fluctuations AE, of the bias mode. As shown 
in stage II of Fig. 3B, early stages of the OPO 
evolution play a critical role in picking the 
steady state of weakly biased OPOs. This ob- 
servation allowed us to extrapolate a simple 
statistical model of the phase measurement 
process in a biased OPO (supplementary text, 
section D), where the probability parameter p 
is given by the area under the positive-field 
side of the ground-state wave function density 
|Wo(Evac)|? displaced by Eyeax (Fig. 3C). 

This simple model, which is in agreement 
with our theory, also confirms the origin of the 
randomness in our experiment and all other 
OPO-based random-number generators: quan- 
tum statistical fluctuations of the fields in the 
early stages of field amplification. In low-gain 
regimes characteristic of most experimental 
realizations, including ours, the probability 
distribution is approximately independent of 
the gain (Eq. 1 and supplementary text, section 
B). This allowed us to map the ground-state 
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Fig. 4. Measuring sub- A 
photon-level ultrashort 
pulses. (A) Schematic of the 
bias time-delay control in 
this experiment. (B) Field 
reconstruction method. 

(C) Measured probability as 
a function of the time delay 
Atpias: (D) Corresponding 
reconstructed bias field. 

(E) Extracted field envelope 
and corresponding sech fit. 
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(F) Central part of the pulse es 
and corresponding fit. ox 
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wave function density |'¥o (Eyac) ? to the bias 
sensitivity of the probability dp/db (Fig. 3D). 
The reconstructed |'Vo (Evac) ie closely matches 
the expected distribution from quantum op- 
tics |‘o(Evac)|” exp |—E?/ (2AE2,..) |. This re- 
sult is another strong indication that biased 
vacuum fluctuations are at the origin of the 
randomness in our experiment. 

The probability’s sensitivity to the phase 
bias can be directly used to sense the temporal 
dependence of sub-photon-level fields (fields 
for which the mean number of photons in the 
pulse is less than one). In the schematic of our 
field-sensing experiment (Fig. 4A), the bias 
field was time-delayed with respect to the 
pump. At each Atpjas, we recorded statistics 
of the OPO phase (Fig. 4B). By inverting the 
bias-probability relationship, we could esti- 
mate the time-dependent bias field: b(Atpias ) 
erf '[2p(Atpias) — 1], where erf ' is the inverse 
of the erf function. 

The reconstructed field b(Atpias) (Fig. 4D) 
reproduces the main characteristics of the 
bias’ temporal line shape. From the field line 
shape, we could extract the pulse width and 
center wavelength (data and fits are shown in 
Fig. 4, E and F). Our analysis yielded estimated 
values for the main lobe of ~220 + 6 fs and 
~1520 + 0.2 nm, respectively, which is in agree- 
ment with nominal values of the laser we used 
for this experiment. Specifically, the bias en- 
velope is ~30 fs broader than the theoretical 
value, which can be attributed to a ~100-fs OPO 
signal at early times, which is consistent with 
the pump temporal width. We show the in- 
fluence of OPO signal bandwidth and center 
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frequency on our reconstruction method in 
the supplementary text, section C, and we dis- 
cuss further justification of this reconstruction 
method in section B. Although we demonstrated 
bias and vacuum field reconstruction in this 
work, our method could in principle be used 
to sense the OPO signal by measuring the 
influence of a known bias field on the phase 
statistics. We expect this method to prove par- 
ticularly useful in measuring transient quan- 
tum states of light that can be generated in 
OPO cavities (39). 


Discussion and conclusions 


Taken together, the methods proposed in this 
work suggest the use of vacuum-level bias 
fields as a probe of transient dynamics in 
nonlinear driven-dissipative systems with 
potential applications in sensing and comput- 
ing. Our sensing method could be extended 
to observe the field dynamics in the cavity 
(Fig. 3D) by appropriately sweeping over the 
bias time delay and amplitude. Conversely, 
quantum states of light for biasing offers 
another degree of freedom to shape the bias- 
probability relationship. 

An anticipated extension of our work will be 
to incorporate this photonic p-bit into a com- 
puting platform to realize photonic proba- 
bilistic computing (27). This can be realized 
by leveraging the extensive body of work on 
photonic Ising machines (40, 41). The main 
speed limitations of our p-bit come from the 
control electronics and cavity mode lifetime 
(20). To increase the repetition rate of our 
photonic p-bit, one could resort to integrated 
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photonics (27) or multimode laser cavities (23). 
Realizing our approach in multistable systems 
[for example, in integrated arrays of nano- 
lasers (42)] paves the way to p-bits with extreme * 


bandwidth, emulation of complex many-body 


Hamiltonians (by engineering couplings be- 


tween p-bits), and nontrivial p-bit topologies 


(by using Hamiltonians Hp with higher-order 
symmetries). 
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Engineering ligand reactivity enables high-temperature 
operation of stable perovskite solar cells 


So Min Park'}, Mingyang Wei*+, Jian Xu'}, Harindi R. Atapattu’, Felix T. Eickemeyer, 

Kasra Darabi’*, Luke Grater’, Yi Yang®, Cheng Liu®, Sam Teale’, Bin Chen, Hao Chen’, 
Tonghui Wang“, Lewei Zeng’, Aidan Maxwell’, Zaiwei Wang’, Keerthan R. Rao*, Zhuoyun Cai°, 
Shaik M. Zakeeruddin’, Jonathan T. Pham®, Chad M. Risko*, Aram Amassian‘*, 

Mercouri G. Kanatzidis®, Kenneth R. Graham**, Michael Gratzel**, Edward H. Sargent'>”* 


Perovskite solar cells (PSCs) consisting of interfacial two- and three-dimensional heterostructures 
that incorporate ammonium ligand intercalation have enabled rapid progress toward the goal of uniting 
performance with stability. However, as the field continues to seek ever-higher durability, additional 
tools that avoid progressive ligand intercalation are needed to minimize degradation at high 
temperatures. We used ammonium ligands that are nonreactive with the bulk of perovskites and 
investigated a library that varies ligand molecular structure systematically. We found that fluorinated 
aniliniums offer interfacial passivation and simultaneously minimize reactivity with perovskites. Using 
this approach, we report a certified quasi-steady-state power-conversion efficiency of 24.09% for 
inverted-structure PSCs. In an encapsulated device operating at 85°C and 50% relative humidity, we 
document a 1560-hour Tg5 at maximum power point under 1-sun illumination. 


etal-halide perovskite solar cells (PSCs) 

are emerging photovoltaic (PV) tech- 

nologies that hold promise in terawatt- 

scale deployment (7). They unite high 

power-conversion efficiencies (certi- 
fied PCEs up to 25.7%) (2) with low-cost solu- 
tion processing that uses abundant materials 
(3-5). To compete with crystalline silicon (c-Si) 
solar cells and to be applied with c-Si in tan- 
dem cells, PSCs will need improved evidence 
of bankability (6), such as operating stabil- 
ity at elevated temperatures in accelerated 
aging (7-12). 

Interfaces, which have high defect densities 
(13, 14) and low barriers to ion transfer (15, 16) 
and are susceptible to moisture- and oxygen- 
induced degradation (/7, 18), can provide path- 
ways for energy loss and degradation in PSCs 
(19, 20). In record-efficiency PSCs, interfaces 
have been passivated by using two- and three- 
dimensional (2D/3D) hybrid structures (27-23), 
where a thin layer of Ruddlesden-Popper 2D 
perovskites terminates the 3D-perovskite sur- 
face. These 2D/3D structures are constructed 
by exposing perovskite surfaces to a solution 


Department of Electrical and Computer Engineering, University 
of Toronto, 10 King’s College Road, Toronto, Ontario MSS 364, 
Canada. “Laboratory of Photonics and Interfaces, Ecole 
Polytechnique Fédérale de Lausanne, 1015 Lausanne, 
Switzerland. *Department of Chemistry, University of 
Kentucky, Lexington, KY 40506, USA. “Department of 
Materials Science and Engineering, and Organic and Carbon 
Electronics Laboratories (ORaCEL), North Carolina State 
University, Raleigh, NC 27695, USA. °Department of 
Chemistry, Northwestern University, Evanston, IL 60208, 
USA. °Department of Chemical and Materials Engineering, 
University of Kentucky, Lexington, KY 40506, USA. 
Department of Electrical and Computer Engineering, 
Northwestern University, Evanston, IL 60208, USA. 

tThese authors contributed equally to this work. 

*Corresponding author. Email: ted.sargent@utoronto.ca (E.H.S.); 
michael.graetzel@epfl.ch (M.G.); kenneth.graham@uky.edu (K.R.G.) 


Park et al., Science 381, 209-215 (2023) 14 July 2023 


containing ammonium ligands, during which 
the 3D lattice is fragmented into 2D layers 
(24, 25). This approach is beneficial because 
the 2D overlayer increases the resistance of 
perovskites to degradation (17, 26). 

There is a diversity of findings in reports 
on the stability of 2D overlayers under stress. 
Some studies have indicated limited stabil- 
ity of the 2D/3D interface under thermal 
stress (27-29). Progress has been made, with 
a recent study demonstrating 500-hour op- 
erating stability under conditions of 65°C, 
50% relative humidity (RH), and maximum 
power point (MPP) tracking [International 
Summit on Organic Photovoltaic Stability 
(ISOS)-L-3-65°C], enhanced by the incorpora- 
tion of 3-fluorophenethylammonium (3FPEA) 
intercalation (23). Another study (30) that 
used an oleylammonium-intercalated 2D over- 
layer showed promising results in a 1000-hour 
damp-heat test, although this study was based 
on silicon PV module IEC 61215:2016 stan- 
dards (85°C and 85% RH, dark). 

Given these promising performance and 
stability improvements, the field continues to 
pursue ever-higher durability targets. There is 
interest in raising the standard to 85°C ISOS- 
L-3 (MPP tracking at 85°C and 50% RH). How- 
ever, the reactivity of ammonium ligands with 
3D perovskites may lead to further penetra- 
tion into the bulk perovskite film (28, 31, 32) 
and may contribute to deterioration in device 
performance under these very demanding stress 
conditions (27, 28, 31-33). Thus, we explored 
noninvasive surface-passivating ligands, but 
we did note that in prior studies, these have 
yet to achieve the remarkable benefits of the 
2D/3D strategy. 

We sought to understand how molecular 
structure affects ligand reactivity with 3D 


q 


perovskites. Previous studies used bul ree 


ammonium ligands to suppress ligand it... 
calation, whereas certain small-sized ammo- 
nium ligands exhibit low reactivity (34, 35). We 
attribute this difference to the limited availa- 
bility of characterization techniques that allow 
one to compare ligand reactivity among dif- 
ferent ammonium molecules—a particularly 
challenging pursuit in view of the ultrathin 
nature (typically <10 nm) of the passivating 
overlayer (23). 


Spectroscopy studies of ammonium 
ligands in perovskites 


We used angle-resolved x-ray photoelectron 
spectroscopy (AR-XPS) and time-of-flight sec- 
ondary ion mass spectrometry (TOF-SIMS) 
to investigate the penetration of a library of 
ammonium ligands into the bulk of perov- 
skites (Fig. 1A), which include varying tail 
groups and alkyl chain lengths. A small-sized, 
ammonium ligand, anilinium (An), had the 
lowest reactivity with 3D perovskites. We then 
explored derivatives of An with different de- 
grees of fluorination to couple low ligand 
reactivity with effectual interfacial passiva- 
tion. These molecules improved operating 
stability for the encapsulated PSC at 85°C ‘ 
and 50% RH. 

Ammonium ligands were deposited on 
CS9.05MAo.05FAo.9Pb(Io.95Bro.05)3 perovskite 
films (materials and methods), and films + 
were annealed at 100°C (37). We used AR-XPS 
to probe the spatial distribution of ammo- 
nium ligands in the out-of-plane direction (Fig. 
1B). AR-XPS is a nondestructive depth-profiling 
technique used to determine the composition 
of ultrathin layers (<10 nm) at the top sur- 
faces of films (36, 37). We varied the probe 
depth by changing the angle between the nor- 
mal of the perovskite sample and the ana- 
lyzer. Three different electron takeoff angles < 
(0°, 45°, and 75°) were selected, with probe ‘ 
depths varying from ~6 to 8 nm at 0° to ~1 
to 2 nm at 75° (Fig. 1C). 

To obtain the proportion of ammonium li- ,. 
gands at a given probe depth, we compared the 
ratio of C-N, where N signals originated from 
methylammonium (MA) and ammonium li- 
gands at a binding energy of ~401.5 eV, with the 
N signals originating from formamidinium 
(FA) at a binding energy of ~399.8 eV in the 
peak area in the N Is XPS spectrum (figs. S1 
and S2). The C-N/FA N ratio plots (Fig. 1D) 
revealed that the aryl ammoniums phenethyl- 
ammonium (PEA) and 3FPEA had relatively 
uniform depth distributions. By contrast, alkyl 
ammoniums butylammonium (BA) and octyl- 
ammonium (OA) tended to accumulate on the 
top surfaces, as evidenced by the C-N/FA N 
ratio increasing with larger electron takeoff 
angles. However, each of the ligands yielded 
C-N/FA N ratios greater than those of un- 
treated perovskites (control) for all electron 
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takeoff angles (Fig. 1E), indicating their dis- 
tinct contributions to C-N signals and their 
intercalation into the bulk of perovskites. 

We then examined long-chain alkyl ammo- 
nium decylammonium (DA), along with small- 
sized An (34, 35, 38). For DA, the C-N/FA N ratio 
decreased with a larger electron takeoff angle 
(Fig. 1D). The reversed trend, as compared 
with other alkyl ammoniums, suggested that 
DA molecules diffused into the bulk of perov- 
skites. By contrast, An exhibited low reactivity 
and had limited penetration into perovskites. 
The C-N/FA N ratios for An were a factor of 
~2 less than those for PEA at different electron 
takeoff angles and are similar to control perov- 
skites (Fig. 1E). 

At these low C-N/FA N ratios, quantifying 
the extent of ligand penetration was difficult, 
given the prominent C-N contribution from 
MA cations (fig. S2 and table S1). We thereby 
performed TOF-SIMS and found that, whereas 
PEA cations were distributed throughout the 
thickness of the perovskite film, An cations 
were detected only at the top surface of the 
film (fig. $3). The combination of AR-XPS and 
TOF-SIMS supported An remaining surface 
localized. 


Fig. 1. AR-XPS characterization of ammonium 
ligand penetration. (A) Chemical structures 

of the ammonium ligands investigated in this 
study. (B) Experimental setup for AR-XPS charac- 
terization. The electron takeoff angle o is defined 
as the angle between the normal of the sample 
substrate and the analyzer. (©) Schematic indi- 
cating the relation between o and the photo- 
electron probing depth, d. (D and E) C-N to 

FA N ratios at electron takeoff angles of 0° 45° 
and 75° for the control (untreated) and ammonium 
ligand-treated perovskite films. The results 
show reduced bulk and surface existence for An 
and An derivatives. 
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Although An exhibited low ligand reactivity, 
it has also been reported to have limited pas- 
sivating capacity (35, 39). To promote interfacial 
passivation, we explored fluorinated An, including 
4-fluoroanilinium (4FAn), 2,6-difluoroanilinium 
(26FAn), and 3,4,5-trifluoroanilinium (345FAn), 
as well as tert-butyl-substituted 3,5-di-tert- 
butylanilinium (35tbuAn) (/7, 40, 41). The C-N/ 
FA N ratios (Fig. 1E) revealed that, compared 
with An, the newly synthesized ligands re- 
tained similarly low reactivity with perovskites. 
For fluorinated An, this low reactivity was also 
seen in the C-F signal in the F 1s XPS spectrum 
(fig. S4). We identified a distinct C-F peak for 
3FPEA, whereas the C-F signals from 4FAn, 
26FAn, and 345FAn were all below the detec- 
tion limit at the different electron takeoff angles. 
TOF-SIMS further confirmed that 345FAn 
cations were localized to the surface of the 
perovskite film (fig. S5). 

We then characterized the phase transfor- 
mation of perovskites driven by their inter- 
action with ammonium ligands. Ultrafast 
transient reflection (TR) spectroscopy was 
used initially to detect 2D perovskite forma- 
tion (42). For the PEA-treated perovskite films, 
negative reflectance features associated with 


n = 1, 2, and 3 layered 2D perovskites were 
observed in the TR spectra (Fig. 2A). The 
relative proportion of 2D phases increased as 
the solution exposure time was prolonged (fig. 
S6) and was indicative of progressive phase 
transformation. For the An- and 345FAn-treated 
perovskite films, no 2D phases were observed 
(Fig. 2, B and C), which we attributed to their 
inability to convert 3D perovskites into 2D 
phases (fig. S7). These results reinforce the model 
proposing that inhibiting 2D phase conversion 
would also reduce ligand penetration. 

We also investigated phase transformation 
with grazing incidence wide-angle x-ray scat- 
tering (GIWAXS) measurements. Consistent 
with the TR studies, n = 2 2D perovskites were 
observed at around q=0.29A”™ along q, for 
the PEA-treated perovskite film (Fig. 2D) that 
blocks vertical carrier transport (32). Inci- 
dence angle-dependent diffraction patterns 
of PEA-treated perovskites revealed that the 


2D phase formation was more pronounced 


near the surface (fig. S8). No 2D phase was 
detected in the An- and 345FAn-treated perov- 
skites (Fig. 2, E and F). 

In addition, we observed the formation of PbI, 
at q=0.9 A” for An-treated perovskite films, 
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which we attributed to polar solvent-induced 
decomposition during solution exposure, as 
seen in x-ray diffraction (XRD) characteriza- 
tion (fig. S9). This is a topic that requires further 
in-depth investigation. GIWAXS characteri- 
zation showed that 345FAn protected the 
underlying perovskites from this solvent- 
induced degradation (Fig. 2F). We associate 
the reduction in surface degradation with in- 
creased hydrophobicity of this fluorinated 
ligand (fig. S10) (26). 


Theoretical calculations 


The interactions of ammonium ligands and 
perovskites are depicted in Fig. 3A. We per- 
formed density functional theory (DFT) cal- 
culations to explore how molecular structure 
affected these interactions. We first calcu- 
lated the binding energies (£,) of two adja- 
cent perovskite fragments at their interface 
with the insertion of ammonium ligands (Fig. 
3B). A more negative £, implies that it is 
thermodynamically more favorable to form 
the 2D/3D heterostructures (24). Consistent 
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with the AR-XPS studies, penetrating PEA had 
a more negative EF, value than alkyl ammo- 
niums OA and BA, whereas An and its deriv- 
atives showed the least tendency to form 
2D/3D structures (Fig. 3C). 

It has been suggested that the formation of 
2D perovskites is related to the steric hindrance 
around the NH," group (43). We calculated the 
steric effect index (STEI, defined as the steric 
environment around the NH;* group) of the 
ammonium ligand (43) and found that An had 
a much larger STEI (2.34) than that of PEA 
(1.21). Furthermore, phase transformation en- 
tailed the replacement of A-site cations at 
2D/3D interfaces (24). DFT calculations indi- 
cated that this process demanded more energy 
for An-intercalated interfaces than for PEA- 
intercalated ones (fig. S11). This evidence fur- 
ther supported our finding that An was less 
likely to penetrate the bulk of perovskites 
through ligand intercalation. 

We proceeded to examine the interactions 
between ammonium ligands and perovskite 
surfaces. We calculated the interaction energy 
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(Ein) for the scenario in which an ammonium 
ligand occupied an MA-vacancy site on the 
perovskite surface (Fig. 3D). Electrostatic po- 
tential calculations showed that fluorine sub- 
stitution resulted in a higher positive charge 
density near the ammonium group (fig. S12) 
that could enhance its binding with the neg- 
atively charged MA vacancy (13). Indeed, the 
Ejnt Values were —0.74, —0.89, —0.92, and —1.22 eV 
for An, 4FAn, 26FAn, 345FAn, respectively 
(Fig. 3E), indicating that 345FAn exhibited 
the strongest interaction with the perovskite 
surface. Future computational studies could 
beneficially predict the binding energy of 
the N 1s XPS peak. 


Perovskite degradation and solar cell studies 


We used TR and TOF-SIMS to study perovskite 
degradation. The TR results indicated that PEA- 
based 2D perovskites were thermally unstable 
on the film surface and decomposed into PbI, . 
after thermal aging at 85°C for 2 hours (Fig. 4, 
A and B). This process we associated with the 
diffusion of PEA cations into bulk perovskites, 
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Fig. 2. Phase transformation of perovskite films. (A to C) Pseudocolor representation of the transient reflectance spectra, AR/R, for the PEA- (A), An- (B), 

and 345Fan-treated (C) perovskite films, respectively. n = 1, n = 2, n = 3 2D perovskites (nm, M2, and n3) appear in the film after PEA passivation. (D to F) GIWAXS 
image for perovskite films of PEA (D), An (E), and 345FAn (F) passivation, respectively. The color bar shows the diffraction intensity collected from the GIWAXS 
detector. gy, and qg, represent in-plane and near out-of-plane scattering vectors, respectively. n = 2 2D perovskite (nz) and Pbl2-related diffraction peaks are observed 
in the PEA- and An-treated films, respectively. (100) refers to the (100) diffraction peak of 3D perovskites. 
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Fig. 3. DFT studies. A 
(A) Schematic depiction 

of the model of ammonium 
ligand and perovskite 
interactions. (B) Models 
used in DFT calculations 
to calculate the binding 
energy (Ep) of adjacent 2D 
perovskite fragments. 

(C) £, values for seven 
ammonium ligands sub- 
stituted into MAPbl3 perov- 
skite slabs. (D) Models 
used to calculate the 
interaction energy (Eint) of 
ammonium ligands with 
the MAPbl3 perovskite 
surface. (E) Eint values of 
four ammonium ligands with 
the perovskite 

surface. 


3D phase 


as was seen with TOF-SIMS (Fig. 4C) and AR- 
XPS (fig. S13). The dynamic nature of 2D/3D 
heterostructures was also confirmed for BA, OA, 
DA, and 3FPEA (fig. S14). By contrast, neither 
ligand penetration nor phase degradation was 
observed for An- and 345FAn-treated films 
(Fig. 4C and fig. S15). These results suggested 
that suppressing ligand intercalation into 3D 
perovskites improved interface stability under 
thermal stress. 

We monitored the photoluminescence (PL) 
stability of perovskite films after annealing 
at 85°C (Fig. 4D). The emission intensity of 
the untreated control and PEA-treated films 
degraded to 30% of its initial value within 
144 hours (Fig. 4E), as seen in the previous re- 
ports (37). By contrast, An- and 345FAn-treated 
films show improved thermal stability, and 
345FAn-treated perovskites retained 85% of 
their initial brightness after annealing. 
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2D/3D heterostructure 


E, (eV) 


345FAn 


345FAn 


We fabricated inverted-structure PSCs to 
investigate how ammonium ligands influ- 
enced the efficiency and stability of PSCs. We 
used both CsoosMAoosFAogPbUogsBroos)3 and 
CSo.05MAp15FAg.sPbI3 perovskites as the ab- 
sorber, the self-assembled monolayer 2PACz 
as the hole transport layer, the thermally evap- 
orated Cg ./bathocuproine (BCP) bilayer as 
the electron transport layer, and indium tin 
oxide (ITO) as the transparent electrode (ma- 
terials and methods). Three representative am- 
monium ligands, including the 2D-forming 
PEA cation and the noninvasive An and 345FAn 
cations, were used for interfacial engineering. 
The cross-sectional scanning electron micro- 
scope (SEM) image of a complete device is 
shown in fig. S16. 

Device performance (Fig. 5A) showed that 
compared with control devices, PEA-treated 
CSo.05MAp15FAg.sPbI3 PSCs (16 devices) pro- 
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vided improved PV performance with an av- 
erage PCE of 23.2% (Fig. 5A and fig. S17). The . 
average PCE of An devices decreased to 19.9%, 
a finding we linked to perovskite decomposition. 
Because of the improved surface passivation, 
345FAn treatment increased average PCEs of 
CSoosMAoisFAosPbIz and Csoo5MAoosFAo Pb 
(o95BYo.05)3 PSCs to 23.3 and 22.9%, respectively 
(Fig. 5A and fig. S18). The external quantum 
efficiency (EQE) spectra of the champion cells 
(Fig. 5B) for PEA and 345FAn devices exhib- 
ited improved charge collection compared with 
controls, and their integrated short-circuit cur- 
rent density (J,.) values matched well with 
those from the current-voltage (I-V) sweep. 

We sent a 345FAn-treated Cspo5MAo15FAosPbI3 
device to National Renewable Energy Laboratory 
(NREL, USA) for independent characteriza- 
tion. The device delivered a certified PCE of 
24.09% by using a quasi-steady-state (QSS) I-V 
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Fig. 4. Thermal stability of perovskite films. (A and B) Pseudocolor 
representation of the transient reflectance spectra for PEA-treated perovskite 
films before (A) and after (B) thermal aging at 85°C for 2 hours. 2D perovskites 
were decomposed into Pbl2 after thermal aging. (C) The distribution of PEA* 
and An* in fresh and aged (at 85°C for 2 hours) perovskite films observed with 


sweep (Fig. 5C and fig. S19). The QSS-certified 
efficiency reported for inverted PSCs is com- 
pared with that of other devices in table S2. 
Seeking a better understanding of the im- 
proved performance, we used ultraviolet photo- 
electron spectroscopy (UPS) to characterize the 
electronic structure of the perovskite films. The 
secondary electron cutoff spectra (fig. S20) 
showed that 345FAn decreased the work func- 
tion of perovskites from 4.64 to 4.17 eV and 
shifted the valence-band maximum from 1.15 eV 
below the Fermi level in the control perov- 
skites to 1.50 eV below the Fermi level. These 
results show that 345FAn induced more n-type 
character in the perovskite film (fig. S20), which 
in inverted PSCs leads to favorable band bend- 
ing for electron extraction and a decrease in 
nonradiative carrier recombination (44, 45). 
We performed optoelectronic characteriza- 
tion to investigate carrier recombination. From 
photoluminescence quantum yield (PLQY) mea- 
surements, we identified nonradiative recom- 
bination at the perovskite/Cg, interface as the 
limiting factor for the performance of control 
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devices (fig. S21). This recombination loss cor- 
responded to a reduction of the quasi-Fermi 
level splitting (QFLS) by ~100 meV (table S3) 
(46, 47). We found that PEA and 345FAn pas- 
sivation could reduce the interfacial nonradiative 
recombination, as reflected by the improved 
QELS relative to the control stack, by ~15 and 
~30 meV, respectively. We further analyzed the 
energy loss of PSCs on the basis of the device 
diode characteristics (figs. S22 and S23) (48). 
Consistent with the PL studies, nonradiative 
losses were reduced for PEA and 345FAn 
devices (12.7 and 12.1% for PEA and 345FAn, 
respectively) compared with those of the con- 
trol device (16.5%) (fig. S24). 

We evaluated the effect of ligand reactivity 
on the material processing of PSCs. We varied 
the solution exposure time between 0 s (dy- 
namic spinning) and 120 s and tracked the PV 
parameters of PSCs accordingly (Fig. 5D and 
fig. S25). The average PCE of PEA devices (eight 
devices for each condition) dropped to 15.8%, 
with an average fill factor (FF) of merely 67.4% 
over the course of solution exposure for 30 s, 


TOF-SIMS. (D) PL spectra of perovskite thin films under thermal annealing at 
85°C. The films were kept on a hot plate at 85°C in the N--filled glovebox 
and measured every 24 hours in ambient air. (E) Changes in PL intensity for 
different annealing times. Data are presented as mean values + standard 
deviation. Three films were measured under each condition. 


whereas 345FAn devices maintained an average 
PCE of 23.0% (Fig. 5D). Wide solution-processing 
windows were also achieved for other non- 
penetrating ligands, such as 35tbuAn, 4FAn, 
and 26FAn (fig. S26). In pursuit of initial evi- 
dence of capacity to scale area, we fabricated 
perovskite solar modules (PSMs) with an active 
area of 22 cm? and nine interconnected subcells 
(materials and methods). The champion PCE 
under reverse voltage scan was improved from 
19.9 to 20.8% for the 345FAn-treated PSMs, 
with an open-circuit voltage (V,.) of 10.13 V, an 
FF of 74.2%, and a J, of 2.77 mA cm ° (fig. S27). 

We studied the operating stability of PSCs 
using ISOS protocols (49). Accelerated lifetime 
testing was performed at the ISOS-L-3-85°C 
level, constituting light-soaking tests at 50% 
RH and 85°C with MPP tracking. We used 
atomic-layer deposition for SnO, as the buffer 
layer and Cso05MAo05FAo 9Pb(Io.9sBro.05)3 Perov- 
skites as the absorber (materials and methods). 
The initial PCEs of the encapsulated control, 
PEA, and 345FAn devices were 18.9, 20.1, and 
20.2%, respectively (fig. S28). The stability 
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Fig. 5. Photovoltaic performance of PSCs with interface engineering. 

(A) PCE for control (8 devices) versus ammonium ligand-treated (16 devices 
for each type) PSCs. (B) EQE curves of the control and ammonium ligand— 
treated devices. (C) QSS current density-voltage (J-V) curve of one 
representative 345FAn device certified at NREL. (Inset) PV parameters of the 
device. (D) PCE evolution of PEA and 345FAn devices as a function of 
solution exposure time (8 devices for each condition). Data are presented as 


results are shown in Fig. 5E. We recorded a 
similar 79 of ~220 hours for the control and 
PEA devices, in accordance with the limited 
thermal stability of perovskite films (Fig. 3E). 
We found that 2D/3D PSCs based on alkyl am- 
monium BA and OA were even less stable than 
the control device (fig. $29). By contrast, the 
345FAn device retained >80% of its initial 
value after the 800-hour test. We estimate the 
Tgo for the 345FAn device to be ~810 hours (fig. 
$30), representing a fourfold enhancement 
over the operating lifetime of the PEA device. 
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To improve the 7g further, we moved to 
fluorine-doped tin oxide (FTO) as an alter- 
native transparent electrode (fig. S31) because 
of its chemical stability (50, 57). We monitored 
the MPP of the encapsulated device under 1-sun 
illumination at 50% RH and 85°C. The ini- 
tial PCE was 19.0%, which increased progres- 
sively to a peak value of 19.9% after ~50 hours 
of operation (Fig. 5F). This maximum PCE 
(PCEmax) corresponds to a temperature co- 
efficient of —0.12% /°C relative to the room 
temperature device PCE (21.5%), which is con- 


mean values + standard deviation. (Inset) Schematic illustration of the 

solution exposure process for interface engineering. (E) MPP tracking of 
encapsulated control, PEA, and 345FAn devices performed using ITO substrates 
at 85°C with an RH of ~50% under 0.8-sun illumination. (F and G) PCE (F), 
current density (Impp), and voltage (Vinpp) (G) at the MPP of the encapsulated 
345FAn device determined using an FTO substrate under 1-sun illumination 

at 85°C and ~50% RH. 


sistent with reported results for inverted PSCs 
(62). After 1586 hours of continuous operation, 
the PCE dropped to 16.8% (84% of the PCEmax), 
primarily because of a reduction in current 
density (Fig. 5G). We determined the 7g, to be 
~1560 hours (Fig. 5F). A comparison with other 
ISOS-L-3-stable PSCs is provided in table S4. 


Discussion 


Two types of 2D/3D heterostructures have 
been reported for ISOS-L-3-stable PSCs: 3FPEA- 
intercalated 2D/3D perovskites (23) and 
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all-inorganic heterostructures (72). For all- 
inorganic PSCs, the Ts; reached an impressive 
4000 hours at 85°C, but with further room for 
progress in PCE (<17% when measured at 
85°C). 3FPEA-based PSCs have achieved PCEs 
exceeding 23% at room temperature, but 
stable operation over 500 hours was limited 
to 65°C. 


Aided by spectroscopic techniques, includ- 


ing AR-XPS, TOF-SIMS, and TR, we see evi- 
dence here that certain 2D/3D heterostructures, 
such as those employing typical ammonium 
ligands including PEA, 3FPEA, BA, and OA, 
may evince thermal degradation, something 
we assign to the dynamic nature of the inter- 
faces at elevated temperatures. Anilinium and 
its fluorinated derivatives offer a more robust 
interface structure correlated with limited 
penetration into the bulk of perovskites. Flu- 
orinated An contributes interface passivation 
linked to strong interactions with perovskite 
surfaces. The resultant optimized 345FAn de- 
vices are among the most stable PSCs seen 
under ISOS-L-3 protocols at 85°C, achieving 
PCE of 19.9% during MPP tracking at this 
elevated operating temperature. 


REFERENCES AND NOTES 


1. 


Park et al., Science 381, 209-215 (2023) 


N. M. Haegel et al., Science 364, 836-838 (2019). 

National Renewable Energy Laboratory, Best research-cell 
efficiency chart, revised 26 May 2023; https://www.nrel.gov/ 
pv/cell-efficiency.html. 

H. Min et al., Nature 598, 444-450 (2021). 

M. Kim et al., Science 375, 302-306 (2022). 

J. Jeong et al., Nature 592, 381-385 (2021). 

Z. Song et al., Energy Environ. Sci. 10, 1297-1305 (2017). 

Y. Zhao et al., Nat. Energy 7, 144-152 (2022). 

S. Bai et al., Nature 571, 245-250 (2019). 

Q. Cao et al., Sci. Adv. 7, eabg0633 (2021). 

Y. Liu et al., Angew. Chem. Int. Ed. 59, 15688-15694 (2020). 


. Y.-H. Lin et al., Science 369, 96-102 (2020). 
. X. Zhao et al., Science 377, 307-310 (2022). 
. R. Lin et al., Nature 603, 73-78 (2022). 


14 July 2023 


52. 


. X. Li et al., Science 375, 434-437 (2022). 
. L. Chao et al., Research 2020, 2616345 (2020). 
. H. Kim et al., Nat. Commun. 11, 3378 (2020). 


H. Zhu et al., Adv. Mater. 32, 1907757 (2020). 


. S. Yang et al., Science 365, 473-478 (2019). 
. J. Warby et al., Adv. Energy Mater. 12, 2103567 (2022). 
. S. M. Park, A. Abtahi, A. M. Boehm, K. R. Graham, ACS Energy 


Lett. 5, 799-806 (2020). 


. J. J. Yoo et al., Energy Environ. Sci. 12, 2192-2199 (2019). 

. J. J. Yoo et al., Nature 590, 587-593 (2021). 

. H. Chen et al., Nat. Photonics 16, 352-358 (2022). 

. A. H. Proppe et al., Nat. Commun. 12, 3472 (2021). 

. S. Sidhik et al., Science 377, 1425-1430 (2022). 

. Y. Liu et al., Sci. Adv. 5, eaaw2543 (2019). 

. J. Chakkamalayath, N. Hiott, P. V. Kamat, ACS Energy Lett. 8, 


169-171 (2023). 


. C. A. R. Perini et al., Adv. Mater. 34, 2204726 (2022). 

. A. A. Sutanto et al., Nano Lett. 20, 3992-3998 (2020). 

. R. Azmi et al., Science 376, 73-77 (2022). 

. Q. Jiang et al., Nat. Photonics 13, 460-466 (2019). 

. C. Liu et al., Nat. Commun. 12, 6394 (2021). 

. S. Kumar, L. Houben, K. Rechav, D. Cahen, Proc. Natl. Acad. 


Sci, U.S.A. 119, 2114740119 (2022). 


. M. M. Tavakoli et al., Energy Environ. Sci. 11, 3310-3320 (2018). 
. S.-H. Lee et al., ACS Energy Lett. 6, 1612-1621 (2021). 

. C. S. Fadley, Prog. Surf. Sci. 16, 275-388 (1984). 

. B. Ly, T. Qian, H. Ding, Nat. Rev. Phys. 1, 609-626 (2019). 

. Y. Wei et al., Adv. Energy Mater. 9, 1900612 (2019). 


D. Bi et al., Nat. Commun. 9, 4482 (2018). 


. H. Zhu et al., J. Am. Chem. Soc. 143, 3231-3237 (2021). 

. M. A. Hope et al., J. Am. Chem. Soc. 143, 1529-1538 (2021). 
. Y. Yang et al., Nat. Energy 2, 16207 (2017). 

. R. Lyu, C. E. Moore, T. Liu, Y. Yu, Y. Wu, J. Am. Chem. Soc. 143, 


2766-12776 (2021). 


. D. Luo et al., Science 360, 1442-1446 (2018). 

. Q. Jiang et al., Nature 611, 278-283 (2022). 

. M. Stolterfoht et al., Adv. Mater. 32, 2000080 (2020). 
. M. Stolterfoht et al., Nat. Energy 3, 847-854 (2018). 

. H. Zhang et al., Nat. Commun. 12, 3383 (2021). 


M. V. Khenkin et al., Nat. Energy 5, 35-49 (2020). 


. J. D. Benck, B. A. Pinaud, Y. Gorlin, T. F. Jaramillo, PLOS ONE 


9, 6107942 (2014). 


. R.A. Kerner, B. P. Rand, ACS Appl. Energy Mater. 2, 6097-6101 


(2019). 
T. Moot et al., ACS Energy Lett. 6, 2038-2047 (2021). 


ACKNOWLEDGMENTS 


Funding: This research was made possible by the US Department 
of the Navy, Office of Naval Research (grant NOO0014-20-1-2572). 
This work was supported in part by Ontario Research Fund: 
Research Excellence program (ORF7-Ministry of Research and 
Innovation, Ontario Research Fund: Research Excellence Round 7). 


Agreement 


Agreement 


Author contributions: S.M.P., M.W., K.R.G., M.G., and E.H.S. 
conceived the idea and proposed the experimental and model 
design. S.M.P. and M.W. fabricated all the devices and conducted 


his work was also supported under award OSR-CRG2020-4350.2. 
S.M.P., H.R.A., and K.R.G. gratefully acknowledge the US 
Department of Energy, Office of Science, Office of Basic Energy 
Sciences, and the EPSCoR program, under award DE-SCO018208 
or XPS measurements. K.R.R., Z.C., J.T.P., C.M.R., and K.R.G. 
acknowledge funding from the NSF through Cooperative 

849213. Supercomputing resources on the Lipscomb 
High Performance Computing Cluster were provided by the 
University of Kentucky Information Technology Department and 


Center for Computational Sciences (CCS). M.W. acknowledges 


unding from the European Union's Horizon 2020 Research and 
nnovation program under Marie Sktodowska-Curie Grant 
01026353. T.W. and A.A. acknowledge the support of 
he NSF under award ECCS-1936527. A.A. acknowledges partial 


support by the Office of Naval Research under award NOO014-20- 


-2573. This work made use of the NUFAB and Keck-ll facilities 


of Northwestern University's NUANCE Center, which has received 
support 


rom the SHyNE Resource (NSF ECCS-2025633), the 
N, and Northwestern's MRSEC program (NSF DMR-1720139). 


he characterization. S.M.P and H.R.A performed XPS and UPS 


characterization and data analysis. J.X., K.R.R., and C.M.R. 
conducted the DF 


simulation. Y.Y., C.L., and M.G.K. measured 
OF-SIMS and fabricated PSMs. F.T.E., M.W., S.M.Z., and M.G. 


conducted the PL characterization and data analysis. K.D., 


.W., and A.A. performed the GIWAXS measurements. S.T. and 


S.M.P. measured the TR spectra. L.G., B.C., H.C., L.Z., A.M., 
and Z.W. helped with the device fabrication and material 
characterization. Z.C. and J.T.P. performed the contact-angle 
measurements. M.W., 
the manuscript. All authors contributed to data analysis, read, 
and commented on the manuscript. Competing interests: The 
authors declare that they have no competing interests. Data 
and materials availability: All data are provided in the main 
text or the supplementary materials. License information: 
Copyright © 2023 the authors, some rights reserved; exclusive 
licensee American Association for 
claim to original US government works. https://www.science.org/ 
about/science-licenses-journal-article-reuse 


S.M.P., J.X., K.R.G., M.G., and E.H.S. cowrote 


he Advancement of Science. No 


SUPPLEMENTARY MATERIALS 


science.org/doi/10.1126/science.adi4107 
Materials and Methods 

Supplementary Text 

Figs. S1 to S31 

Tables S1 to S4 

References (53-63) 


Submitted 24 April 2023; accepted 6 June 2023 
10.1126/science.adi4107 


7 of 7 


. 


¢ 


RESEARCH 


WOOD ENGINEERING 


Multiplex CRISPR editing of wood for sustainable 


fiber production 


Daniel B. Sulis'?*, Xiao Jiang*, Chenmin Yang’*°, Barbara M. Marques”, Megan L. Matthews>®, 
Zachary Miller*, Kai Lan*, Carlos Cofre-Vega~*, Baoguang Liu>?*”, Runkun Sun’, 

Henry Sederoff2, Ryan G. Bing®, Xiaoyan Sun®, Cranos M. Williams®°, Hasan Jameel‘, 

Richard Phillips*, Hou-min Chang‘, Ilona Peszlen*, Yung-Yun Huang”°, Wei Li”, Robert M. Kelly®, 
Ronald R. Sederoff'?, Vincent L. Chiang'?™, Rodolphe Barrangou”*"*, Jack P. Wang??>* 


The domestication of forest trees for a more sustainable fiber bioeconomy has long been hindered 

by the complexity and plasticity of lignin, a biopolymer in wood that is recalcitrant to chemical 

and enzymatic degradation. Here, we show that multiplex CRISPR editing enables precise woody 
feedstock design for combinatorial improvement of lignin composition and wood properties. By 
assessing every possible combination of 69,123 multigenic editing strategies for 21 lignin biosynthesis 
genes, we deduced seven different genome editing strategies targeting the concurrent alteration 

of up to six genes and produced 174 edited poplar variants. CRISPR editing increased the wood 
carbohydrate-to-lignin ratio up to 228% that of wild type, leading to more-efficient fiber pulping. 
The edited wood alleviates a major fiber-production bottleneck regardless of changes in tree growth 
rate and could bring unprecedented operational efficiencies, bioeconomic opportunities, and 


environmental benefits. 


ood is the most abundant carbon bio- 

mass on Earth and the major source 

of sustainable green fibers (1). Glob- 

ally, =315 billion metric tons of car- 

bon are stored as wood, representing 
=57% of the biogenic carbon sink (J). The bio- 
mass supplies >170 million metric tons of 
virgin fibers annually (2) to meet the growing 
demand for renewable tissue, paper, pack- 
aging, textile, and other fiber products, in- 
cluding structural materials (3). Despite the 
importance of wood fibers, their production 
has remained largely limited to undomes- 
ticated forest trees with often suboptimal 
wood properties that hamper production 
efficiency. The propensity for efficient isola- 
tion of cellulosic fibers from wood is largely 
determined by the content and composi- 
tion of lignin (4-6), one of three major com- 
ponents of wood (7). Lignin is a phenolic 
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polymer that cross-links with cellulose and 
hemicelluloses in the secondary cell walls 
of vascular plants (8, 9). The polymer in an- 
giosperm wood is formed by free radical 
polymerization of three major monolignol 
precursors, 4-coumaryl, coniferyl, and sinapyl 
alcohols, which form the p-hydroxyphenyl 
(H), guaiacyl (G), and syringyl (S) units in 
lignin (10-12). The monolignols are biosyn- 
thesized from phenylalanine through a se- 
ries of enzymatic reactions in a metabolic 
grid consisting of at least 11 enzyme families 
and 24 metabolites (73). The pathway is me- 
diated by hundreds of regulatory influences, 
encompassing transcriptional (/4), transla- 
tional (15), and posttranslational regulations 
(16, 17); enzyme-enzyme interactions (18-20); 
and metabolic regulations (21, 22). More 
than five decades of research have exten- 
sively investigated the individual components 
of lignin biosynthesis and determined the 
effects of their perturbation on lignin con- 
tent and composition in diverse plant spe- 
cies (6, 13, 23). However, these efforts have 
predominantly focused on the modification 
of single genes or gene families, whereas the 
combinatorial effects of multigenic pertur- 
bations have remained elusive. Here, we show 
that strategic multiplex CRISPR editing of 
monolignol biosynthesis genes improves wood 
properties beyond what can be achieved by 
editing single genes or gene families and de- 
bottlenecks a key operational constraint in 
industrial pulp mills. These improvements sub- 
stantially increase fiber production capacity 
while reducing the global warming potential 
of pulp mills, leading to a more sustainable 
and efficient fiber bioeconomy. 


q 


Chec 


Multiplex genome editing strategies for acne 


concurrent wood trait improvement 
Gene targets for multiplex genome editing 
were identified using our established pre- 
dictive model for monolignol biosynthesis 
(13, 15, 22, 24). The model predicts the trans- 
duction of quantitative relationships from 
gene transcript abundances to absolute en- 
zyme abundances, pathway metabolic fluxes, 
and 25 wood chemical and physical properties. 
Using the predictive model, we explored 
69,123 sets of multigenic editing strategies 
to reveal the extent to which individual 
wood properties can be modified through 
multiplex editing of monolignol genes (Fig. 1, 
A and B). The 69,123 strategies encompass 
48,831 gene combinations that represent all 
possible permutations of loss-of-function edit- 
ing of up to six genes, and 20,292 gene com- 
binations that target the combinatorial editing 
of up to three genes and/or overexpression of | 
one gene (data S1A). For each gene combina- 
tion, the model estimated the corresponding 
abundances of monolignol enzymes, pathway 
metabolic fluxes, and 25 wood properties. Tar- 
geting multiple genes expanded the range of 
phenotypic variation attainable by genome 
editing compared with single-gene edits (Fig. ‘ 
1C and data SIB). For example, multigenic 
editing could reduce lignin content to 50.7% 
of the wild-type level, whereas single-gene 
edits only reduced lignin content to 61.3% of * 
the wild-type level (Fig. 1C and data S1B). 
Wood with low lignin content and a high 
syringyl-to-guaiacyl (S/G) ratio is ideal for 
fiber production (25). We mined all 69,123 
strategies to identify gene combinations that 
are predicted to reduce lignin content by at 
least 15% and increase the carbohydrate-to- 
lignin (C/L) ratio by at least 200% and that 
have a higher S/G ratio than wild type. The 
C/L ratio is an indicator of the potential max- + 
imum cellulosic yield for wood fiber (73). The ‘ 
strategies must also have predicted growth 
characteristics (e.g., tree height) that are com- 
parable to (>75%) or exceed those of wild-type . 
controls. Of the 69,123 strategies, only 347 
(0.5%) matched the aforementioned criteria in 
lignin content, C/L and S/G ratios, and tree 
growth (Fig. 1B and data SIC), highlighting 
the need to mine such strategies to practically 
test the most-promising combinations. All 347 
strategies target at least two monolignol genes, 
and 99.7% of the strategies target at least three 
genes (Fig. 1D and data S1C). The number of 
target genes affirms the need for a multigenic 
approach to improve fiber traits in Populus 
trichocarpa. Lignin gene families that most 
frequently appeared in these 347 strategies 
were C3H (coumarate 3-hydroxylase), CCoaAOMT 
(caffeoyl-CoA O-methyltransferase), AldOMT 
(aldehyde O-methyltransferase), PAL (phenyl- 
alanine ammonia-lyase), C4H (cinnamate 
4-hydroxylase), and CAD (cinnamy] alcohol 
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Fig. 1. Identification of multiplex CRISPR editing strategies to improve wood fiber traits. 

(A) Three-dimensional scatterplot of predicted lignin content, S/G ratio, and tree growth (stem 
diameter) for 69,123 genome editing strategies. (B) Venn diagram of the 69,123 strategies showing 
the relationships between predicted lignin content, S/G ratio, C/L ratio, and tree growth (height). 

(C) Distribution of lignin content in single-gene and multigene editing strategies. (D) The total number 
of target genes in the selected 347 strategies. (E) The frequency of each monolignol gene in the 

347 strategies. (F to 1) Global sensitivity analysis of wild-type (a), single-gene editing (b to d), and 
multigene editing (e to k) strategies for lignin content (F), S/G ratio (G), C/L ratio (H), and tree growth 
(height) (I). For additional information, see fig. S1. 
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dehydrogenase) (Fig. IE). We then performed 
sensitivity analysis for these gene families to 
examine the robustness of fiber trait improve- 
ments in response to varying transcript abun- 
dances of nontargeted monolignol genes 
(Fig. 1, F to I; fig. S1; and data S2 and S3) 
and selected seven strategies for editing in 
P. trichocarpa. The seven strategies (table S1) 
encompassed various numbers of target genes 
(from three to six) and were selected on the 
basis of the extent and robustness of predicted 
improvement in fiber traits (reduced lignin, 
increased S/G and C/L ratios, and good growth) 
(data S2 and S3). The selected strategies 
showed a predicted reduction in lignin con- 
tent of up to 35% compared with wild type 
(Fig. 1F and data S3). S/G and C/L ratios in- 
creased up to 248 and 215%, respectively (Fig. 
1, G and H, and data S3), with no change in 
tree height (Fig. 11 and data S3). 


Generating genetic diversity in poplar 
using CRISPR 


To test the seven strategies in planta for their 
modulation of wood properties for fiber pro- 
duction, a multiplex CRISPR construct (26) 
was assembled for each strategy and delivered 
into P. trichocarpa using Agrobacterium 
tumefaciens (27) (figs. S2 to S4, data S4, 
and tables $2 and S3). We generated 174 inde- 
pendent lines of multiplex CRISPR-edited 
P. trichocarpa, including 78 lines that target 
the concurrent editing of three monolignol 
genes, 20 lines that target four genes, 41 lines 
that target five genes, and 35 lines that target 
six genes. The edited lines exhibited varying 
extents of loss-of-function mutations of the 
target genes (Fig. 2A, fig. $5, and data S5). 
Biallelic loss-of-function editing of all target 
genes was obtained in P. trichocarpa for strat- 
egies that target the concurrent editing of 
three or four genes (Fig. 2A, fig. S5, and data 
S5). For strategies that target five or six genes, 
none of the 76 edited lines showed complete 
loss-of-function editing of both alleles in all 
target genes. Nonetheless, we identified sev- 
eral lines (e.g., J-7 and K-28) that harbor sub- 
stantial editing of most target genes (fig. S5 
and data S5). The frequency of biallelic edits 
varied between target genes, ranging from 
86.1% for PtrPAL4 and PtrPAL5 to 9.3% for 
PtrCCoAOMT?2 (Fig. 2B). Most edits consisted 
of small insertions and deletions (INDELs) 
of 1 to 3 nucleotides (nt) in the immediate 
vicinity of the targeted cleavage site (Fig. 2, C 
and D, and fig. S6). However, deletions of up 
to 19 nt were also occasionally observed (Fig. 
2D). No off-target edits were detected in our 
edited lines (table $4). We used RNA sequencing 
to evaluate two potential xylem-expressing off- 
target genomic loci in 12 edited P. trichocarpa 
lines (data S6). The absence of detectable off- 
target editing suggests that genome editing 
is highly specific in P. trichocarpa. The broad 
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Fig. 2. Genetic diversity in multiplex CRISPR-edited P. trichocarpa. 
(A) Percentage INDELs of each target gene in the 58 CRISPR-edited P. trichocarpa 


lines targeting the concurrent editing of three monoligno 


and PtrPAL5). (B) Average percentage INDELs of the target monolignol genes in 
ent CRISPR-Cas—mediated 


the 174 edited P. trichocarpa lines. (© and D) Most-frequ 


variation in the editing profiles of target genes 
in our 174 edited lines created genetic diversity 
in monolignol biosynthesis not present in 
nature (Fig. 2, E to H). Such genetic diversity 
enables an exploration of how multigenic path- 
way perturbations can combinatorially regulate 
wood formation and wood utilization, as well 


as enhance genetic diversity. 
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Phenotypic characterization of multiplex 
CRISPR-edited poplar 

Multiplex editing of monolignol genes in pop- 
lar substantially altered the chemical and 
physical properties of the wood (Fig. 3, A to D; 
fig. S7; and data S7 to S10), as predicted by the 
lignin model (fig. S8 and table S5). Stem wood 
of 6-month-old wild-type P. trichocarpa con- 


PtrC3H3| SO - -- = - 


PtrAICOMT2 Se --- - -- 


5’ — GGAATTCTGTCAGGACTCATGACACEGG — 3’ 


5’ — GGAATTCTGTCAGGACTCATGGCACRGS — 3’ 


5’ — GGAATTCTGTCAGGACTCATG - g-3’ 
5’ — GGAATTCTGTCAGGA - CACHSG - 3’ 
5’ — GGAATTCTG - CACHE -— 3’ 


Spin (*s 


It 


iy 


0 20 40 60 80 100 
INDELS (%) 
C D 
Reference PtrPAL2 PAM Reference PtrPAL4|5 PAM 


5’ — AACACTAATGACCCTTTGAA- CTGGIGIEC — 3’ 


Edited alleles 

5’ — AACACTAATGACCCTTTGA- - CTGEIGGC — 3’ 
5’ — AACACTAATGACCCTTTGAA- - TG@IGGC — 3’ 
5’ — AACACTAATGACCCTT: - CTGGGEc - 3’ 
5’ — AACACTAATGACCCTTTGAAACTGISGGC — 3’ 
5’ — AACACTA c-3’ 


5’ — AACACTAATGACCCTTT - CTGGIBEC - 3’ 


Line H-19-3 


i 


mutations in PtrPAL2 (C) and PtrPAL4 and PtrPAL5 (D). Stem cross sections of 
wild-type (E) and CRISPR-edited H-19-3 trees (F), with the H-19-3 tree showing 
distinct red coloration of the xylem. (G) Six-month-old greenhouse-grown 
CRISPR-edited and wild-type P. trichocarpa. (H) Harvested stem segments from 
CRISPR-edited and wild-type P. trichocarpa. 


tains 20.9% lignin and has a C/L ratio of 3.0 
(data S7). In the multiplex-edited wood, lignin 
content is reduced by up to 49.1%, and the C/L 
ratio increased up to 228%, compared with wild 
type (Fig. 3, Eand F, and data S7, determined 
as averages across 65 lines). Two-dimensional 
nuclear magnetic resonance spectroscopy 
(2D NMR) (Fig. 3, G to J, and figs. S9 to S11) 
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Fig. 3. Phenotypic variation in CRISPR-edited and wild-type P. trichocarpa. 

(A to D) Distribution of lignin content (A), C/L ratio (B), S/G ratio (C), and tree growth 
(height) (D) in CRISPR-edited and wild-type P. trichocarpa. Blue and red colors 

stand for CRISPR-edited lines and wild-type control measurements, respectively. 
Pink represents the overlapped regions of the blue (CRISPR-edited lines) and red 
(wild-type controls) bars. Lignin, C/L ratio, and growth data are from E, F, G, H, |, J, 
and K constructs. S/G ratio data are from E, F, and G constructs. Average lignin 


traits. Error bars represent one standard error of up to three biological replicates. 
(G to J) 2D NMR “C-H (heteronuclear single quantum coherence) correlation 
spectra (side-chain and aromatic regions) of wild type [(G) and (H)] and the 
G-27 edited line (see fig. S8 for other edited lines). Contours in these regions 
were used to estimate the distribution of lignin interunit linkages and lignin 
composition, namely S/G/H ratios, as well as p-hydroxybenzoate (PB). ppm, 
parts per million. (K) Quantitative relationship between lignin content and 


content (E) and C/L ratio (F) of selected CRISPR-edited 


revealed that multigenic strategies also modu- 
lated lignin composition, increasing the S/G 
ratio from 2.7 in wild type to as high as 4.0 in 
the edited lines (Fig. 3C and data S8). As 
expected, lines harboring single-gene edits 
showed either no reduction or a mild reduc- 
tion in lignin content (0.1 to 12.6%) (e.g., lines 
G-7, I-17, and K-9; fig. S12 and data S5 and S7). 
The most significant lignin reductions were 
observed in edited trees that harbor four to six 
gene edits, but strategies that targeted three 
genes also showed significant lignin reduc- 
tions of up to 32% (Fig. 3E, fig. S12, and data 
87). Edited lines with significant changes in 
wood chemical properties predominantly har- 
bored a single-allele loss-of-function mutation 
in PtrC3H3 combined with the mutation of 
either one or both alleles of PtrAldOMT2 and 
PtrCADI1 (fig. S12). Wood elasticity was not 
significantly different between wild-type and 
genome-edited P. trichocarpa, except for two 
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lines (E-9 and G-25), which had reduced wood 
elasticity (fig. S7 and data S9). No change in 
wood density greater than 15.1% (detection 
limit due to the large technical and biological 
variation; see materials and methods in the 
supplementary materials) of the wild-type 
level was observed in the edited P. trichocarpa 
(data S9). Several CRISPR-edited lines with 
significant reductions in lignin content showed 
a reduction in tree growth (e.g., I-1, J-25, and 
K-6) (Fig. 3, D and K; figs. S7 and S12; and data 
S7 and S10). Some multiplex genome editing 
strategies can circumvent certain shortcom- 
ings inherent to typical single-knockout strat- 
egies that often have undesirable phenotypic 
effects on combinations of tree characteristics 
and wood properties. Moving forward, it will 
be necessary to monitor the impact of the 
genome editing outcomes on the tree pheno- 
typic properties and traits relevant to the in- 
dustry in long-term field trials. 


wood volume in CRISPR-edited and wild-type P. trichocarpa. 


Projected industrial benefits of 
debottlenecking fiber production 

Kraft pulping is the dominant process for in- 
dustrial wood fiber production (28-30). To 
understand the techno-economic impacts of — 
CRISPR-edited wood on kraft pulping, we 
used a Carolina Pulp and Paper (CPP) mill 
model (fig. S13) based on an existing industrial 
Brazilian pulp mill with an annual production 
of 1.24 million tons of pulp. Pulp yield and 
production efficiency may be significantly in- 
creased by multiplex editing of monolignol 
genes (Fig. 4, A and B), as evidenced by mi- 
cropulping wood with varying lignin content 
and composition. Reducing lignin content 
proportionally increases pulp yield and could 
reduce the usage of pulping chemicals (Fig. 4, 
A and B; figs. S14 to S17; and tables S6 to S8) 
that are undesirable from an environmental 
standpoint. Furthermore, low-lignin wood 
could reduce the solid content of black liquor, 
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industrial kraft pulp mill using wood with 28% lignin (D) or 16% lignin (E) at an S/G ratio of 2.8. (F) Total life-cycle GWP of producing pulp from 1 BDMT log. 
The scenarios include varied S/G ratios (2.8, 4.0, or 6.0), lignin content (28 or 16%), and production modes (constant or incremental). 


thereby debottlenecking the recovery boiler, 
arguably the most crucial and rate-limiting 
energetic component of pulp mills (Fig. 4, C 
to E, and fig. S18). The debottlenecking then 
enables an incremental production potential 
of the pulp mill by up to 40% (Fig. 4, C to E, 
and fig. S18). Additionally, increasing the C/L 
ratio in wood means that less biomass is 
required to provide the same amount of cel- 
lulose (fig. S17). The benefits of increasing the 
S/G ratio encompassed the reduction of pulp- 
ing chemicals and energy savings on chemical 
recovery (figs. S16 and S18). Tremendous fi- 
nancial benefits may be obtained by reducing 
lignin from 28% to 16% and by increasing the 
S/G ratio from 2.8 to 4.0 (figs. S19 to S21). Such 
changes are predicted to increase the net pres- 
ent value (at a 14% hurdle rate) from —$359.3 
million to $1047.1 million USD for pulp mills 
using natural gas as supplementary energy, 
and from $269.2 million to $1856.9 million 
USD for mills using biomass as supplemen- 
tary energy (figs. S19, C and D, S20, and S21). 
Notably, some of these benefits are achieved 
even in cases of biomass reduction because 
of industrial operational efficiency gains (figs. 
$20 and S21). Although the magnitude of the 
financial gains is forecasted, accuracy will 
eventually be determined by the physiolog- 
ical attributes of the harvested trees at in- 
dustrial scale. 
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Sustainability impact of optimized 

fiber production 

The pulp and paper industry is a major con- 
tributor to greenhouse gas emissions. The US 
Environmental Protection Agency reported 
35 million metric tons of CO, equivalent emis- 
sions from the pulp and paper industry in the 
US in 2020 (3D. Globally, the annual direct CO, 
emissions from pulp and paper production 
reached 168 million metric tons (32). We as- 
sessed the impacts of CRISPR-edited wood on 
the carbon footprint of pulp production. Life- 
cycle global warming potential (GWP) was 
estimated for pulp production from 1 bone dry 
metric ton (BDMT) of CRISPR-edited wood 
and compared with that of wild-type wood. 
For a pulp mill powered by natural gas, reduc- 
ing lignin content from 28% to 16% decreases 
the GWP by up to 8% (Fig. 4F, figs. S22 to S26, 
and tables S9 and S10), and increasing the S/G 
ratio from 2.8 to 4.0 leads to a further 10% 
reduction in the GWP per BDMT of logs pro- 
cessed (Fig. 4F, figs. S22 to $26, and tables S9 
and S10). Altogether, incremental pulp produc- 
tion using edited wood with reduced lignin 
and an increased S/G ratio could reduce GWP 
by up to 20% (Fig. 4F, figs. S22 to S26, and 
tables S9 and S10), providing substantial bene- 
fits to environmental conservation and climate 
change mitigation. These results provide in- 
sights into the potential of genome editing 


technologies for tree improvement, which may 
be further corroborated in field trials in rele- 
vant environmental conditions. This study 
illustrates how strategic multiplex editing to 
alter wood composition could enable more- 
sustainable fiber production with remarkable 
operational efficiencies, bioeconomic value cre- 
ation, and tangible environmental benefits. 
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Multichip multidimensional quantum networks with 
entanglement retrievability 
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Zhihua Li®, Yan Li?”*°, Qihuang Gong'”*°, Hon Ki Tsang®, Daoxin Dai*?*, Jianwei Wang?”®:°* 


Quantum networks provide the framework for quantum communication, clock synchronization, 
distributed quantum computing, and sensing. Implementing large-scale and practical quantum networks 
relies on the development of scalable architecture and integrated hardware that can coherently 
interconnect many remote quantum nodes by sharing multidimensional entanglement through complex- 
medium quantum channels. We demonstrate a multichip multidimensional quantum entanglement 
network based on mass-manufacturable integrated-nanophotonic quantum node chips fabricated on a 
silicon wafer by means of complementary metal-oxide-semiconductor processes. Using hybrid 
multiplexing, we show that multiple multidimensional entangled states can be distributed across multiple 
chips connected by few-mode fibers. We developed a technique that can efficiently retrieve 
multidimensional entanglement in complex-medium quantum channels, which is important for practical 
uses. Our work demonstrates the enabling capabilities of realizing large-scale practical chip-based 


quantum entanglement networks. 


uantum networks aim to transmit, store, 
and process quantum information in a 
mesh of quantum nodes coherently 
linked by quantum channels, going be- 
yond the capabilities of classical inter- 
net (/, 2). It promises many applications, such 
as quantum key distribution (QKD) for secure 
communications, higher-precision clock syn- 
chronization, distributed quantum metrol- 
ogy, and distributed quantum computation. 
The key to upgrading current QKD networks 
(3-5) is realizing entanglement networks that 


A = 
node chip 1 


z= 


Fig. 1. Architecture for multichip multidimensional entanglement networks. 
(A) Graphic representation of a fully connected entanglement network. It consists 
of one central server chip, n quantum node chips (vertices), n multimode 
quantum channels (red edges), and quantum correlations (gray edges). All node 
chips can be fully connected by sharing n(n - 1)/2 pairs of d-dimensional entangled 
photons with different wavelengths, initially produced on the server chip. Each 
multimode channel carries n — 1 wavelength-multiplexed single photons, to be 
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can robustly share complex entangled states 
among a large number of remote quantum 
nodes (J, 2). 

Realizing large-scale and practical quantum 
entanglement networks remains experimen- 
tally challenging. It requires scalable quantum 
hardware, technologies, and architecture for 
the distributions of entangled states among 
a large number of quantum nodes through 
high-capacity quantum channels. Quantum 
photonic integrated circuits are an emerging 
hardware system for quantum information 
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signal: 


multimode-encoding 


processing and communication (6-8). U 
integrated photonic chips, to date, poin.--- 


a 


Chec 
dé 


point QKD (9-13) and entanglement distri- 
bution (14, 15) have been demonstrated in a 
single chip-to-single chip system, and single 
microring resonators have been used as photon 
sources for QKD and networks (J6-18). Com- 
prehensive integrated quantum nodes in networks 
require full-spectrum monolithic integration 
of devices for encoding, decoding, multiplexing, 
operation, and detection of quantum states of 
light. In contrast to single-chip or single chip- 
to-single chip quantum experiments, realizing 
multichip quantum networks remains challeng- 
ing because it not only requires the monolithic 
integration of different devices but requires high 
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received and processed at one node chip. In the experiment, we prototyped an 
entanglement network having n = 3 and d = 4. (B) Hybrid encoding and multiplexing 
of entangled photons. The entangled photons are wavelength-demultiplexed from a 
broadband source and then remultiplexed for entanglement distributions through 
MMFs between node chips. At each wavelength, the d-dimensional quantum 
states are path-encoded on chip, which are coherently transformed to hybrid 
polarization mode-encoding states in fibers. 
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Fig. 2. Integrated silicon-photonic quantum networking chips. (A) Photo- 
graph for a 200-mm silicon-on-insulator (SOI) wafer that contains 27 copies of 
networking chips. The cyan box indicates a single copy, including a server chip 
and a node chip. (B) Circuit diagram. The network consists of a server chip that 
produces three pairs of four-dimensional entangled photons and three node 
chips (A, B, and C) connected by three separate FMFs. The triangular linear 
optical circuits of V4 and U4 are fully programmable to implement arbitrary 
unitary operations for the QER algorithm. M, on the node chips allows arbitrary 
projective measurement. Green parts indicate reconfigurable beamsplitters, and 
yellow parts indicate thermo-optic phase shifters. (©) Measured biphoton 


quantum indistinguishability between different 
chips for scalable quantum networking. Mean- 
while, practical quantum networking requires 
high-capacity and noise-resilient transmission 
of entanglement. Multidimensional quantum 
systems are known as one of the excellent 
candidates, which however are susceptible 
to environmental perturbations when travers- 
ing through complex media such as multi- 
mode fiber (MMF) or air-scattering channels 
(19, 20). As a result, state scrambling and en- 
tanglement degeneration inevitably take place. 
Previous works have demonstrated the capa- 
bilities of unscrambling classical (27, 22) and 
quantum (23, 24) light in complex media by 
using a measure-and-inverse technique, which 
requires the reconstruction of transmission 
matrix and then the inverse operation to recover 
the states. Another all-optics unscrambling 
method has been recently proposed (25) and 
demonstrated for unitary transmission of 
classical light (26). Efficiently retrieving quan- 
tum states of light in complex media is of 
practical interest in real-world quantum net- 
working systems. Moreover, at the architecture 
level, quantum networking devices and tech- 
nologies are expected to sustain a large number 
of quantum nodes and high-capacity quantum 
channels and to be compatible with those well- 
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has been subtracted. 


developed ones in classical telecommunication 
networks, such as multiplexing devices and 
technologies. 

In this work, we demonstrated multichip 
multidimensional entanglement networks. 
Our quantum networking chips were fabricated 
on silicon nanophotonic quantum circuits by 
using mass-manufacturable technologies. Three 
pairs of multidimensional entangled photons 
were generated on a server chip and coher- 
ently distributed across three quantum node 
chips by few-mode fibers (FMFs). We devel- 
oped a quantum entanglement retrieving (QER) 
method to overcome the nonunitary mode 
scrambling and entanglement degradation, 
efficiently recovering multidimensional entan- 
gled states distributed across the multichip 
networks. 


Networking architecture and hybrid 
multiplexing 


The network is enabled by hybrid encoding 
and multiplexing of the photon’s wavelength, 
path, mode, and polarization states (Fig. 1A). 
The network can be architected by a complete 
graph with n vertices (quantum node chips) 
and n(n -1)/2 edges (quantum correlations). 
It can be physically realized as follows: A 
server chip simultaneously produces n(n -1)/2 


spectrum of integrated sources. The grayed area indicates the wavelength 
window in this experiment. (Inset) An energy diagram for the nondegenerate 
SFWM process. (D) Measured spectra of coupled MRRs. Four sets of 
coupled MRRs (12 in total, colored) were measured, showing high-level 
spectral overlapping and low channel cross-talk. (E and F) Characterizations 
of the chip-to-fiber system. The LP modes were selectively excited and 
measured. (E) Simulated and measured LP mode distributions at the FMF 
output facet. (F) Measured spectral response for these modes. The spectral 
response for one-dimensional grating couplers (used for light injection) 


pairs of d-dimensional entangled photons 
(\@f) = 1/Va) | \k)|R), k =0,1,...,d-1)at 
different wavelengths [1.,,, 7 = 1, 2, ..., n(n - 1)/ 
2] and shares them through n MMF quantum 
channels. In each channel, 7 - 1 sets of single 
photons are wavelength-multiplexed to form 


quantum correlations (27, 28) between paired - 
chips, and each photon with the dimension of ‘ 


dis hybridly polarization-mode encoded (Fig. 
1B). The simultaneous distribution of entan- 


glement in the network is enabled through . 


wavelength-multiplexing parallelism, and the 
transmission of multidimensional entangle- 
ment between chips is enabled by coherently 
converting the on-chip multipath-encoding 
to the linearly polarized (LP) multimode encod- 
ing with different polarizations in fibers. The 
network architecture can be highly scalable; 
it only requires wavelength-multiplexed or 
-demultiplexed bipartite entangled states rather 
than multipartite entangled states. Resources 
scale up linearly, requiring 7 node chips linked 
by 7 channels and d photon sources on the 
server chip. The upper bounds of nodes (7) 
and capacity (d) are determined by the spec- 
tral and modal bandwidth of the whole 
system, respectively. 

The 200-mm wafer that contains at least 27 
copies of identical quantum networking chips 
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Fig. 3. Retrieving multidimensional entanglement in a multimode 
chip-fiber-chip system. (A) Flowchart of the QER algorithm consisting of two 
stages, CCU and QAB. The CCU is carried out to recover the mode mixing, and 
the QAB then retrieves maximal entanglement. O; denotes the measured 
optical intensity (normalized) at the ith port. C is the convergent criterion; when 
the cross-talk (1 - 0) to other channels is less than C, the CCU converges. 
(B) Implementing QER on the two reprogrammable photonic chips, linked by a 
FMF. Iteratively reconfiguring the V, or U; can automatically find the right or 
left singular vectors, respectively, of an unknown transmission matrix As. 

ow i + 1 (dashed boxes) successively maps the 
initial ith path-mode onto the corresponding supermode. The QAB and CCU 
processes are decomposed to a sequence of matrix operations on signal and 


Gradually reconfiguring the 


is shown in Fig. 2A. We demonstrate in small 
scale the first multichip multidimensional 
entanglement networks, between three node 
chips and a server chip, linked by three FMFs 
(Fig. 2B, circuit diagram). To implement the 
entanglement network architecture, hybrid multi- 
plexing devices are monolithically integrated. 
The networking chips allow wavelength- 
multiplexing and -demultiplexing of single 
photons; generation, manipulation, and mea- 
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surement of path-encoded multidimensional 
entangled states; as well as coherent conversion 
to hybrid polarization mode-encoded states 
in FMFs. For scalable implementations, the 
chips require a broad bandwidth of spectral 
and modal modes, large fabrication tolerance, 
and high uniformity across the wafer. Broad- 
band (a 3 dB bandwidth of 50 nm) biphoton 
correlations are generated by means of a 
spontaneous four-wave mixing (SFWM) pro- 
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idler photons: loss-balance A,;, phase compensation P,;, the identity /, and 
encoding-reversal R. (C) Simulated mutual information S of the transmitted 
multidimensional entangled states in four cases: without QER, with CCU, 
with CCU + R, and with the full QER. The dashed line indicates the ideal value for 
|@;): Smax = 4. (D) Experimental implementation of CCU. Normalized optical 
intensities for different modes were measured as the algorithm iteratively 
went on. Within 130 iterations, the CCU algorithm converged (when the 
variation of transmission was less than 5% for nearby iterations), and we 
interrupted the channel at step 130, which was online recovered again within 
70 iterations. (Insets) Measured cross-talk matrices at steps 3, 30, 75, 129, 
and 198, and the element (i, /) denotes the cross-talk ratio from ith mode 


cess in silicon waveguides (Fig. 2C). By co- 
herently pumping four sources, three pairs 
of four-dimensional path-encoded Bell state 
C= 1/25) _olk)|A) at paired wavelength 
{N13 Axo, Aus} are prepared, where |X) denotes 
a logical state in the kth path (29). Four ar- 
rays of coupled second-order microring reso- 
nators (MRRs; individually tunable) follow 
to demultiplex and remultiplex the correlated 
photons as {[A_3, Ao], Au, Avo], Aa, Ars]} 


3 of 6 


RESEARCH | RESEARCH ARTICLE 


A stability measurement 
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by the normalized mode cross-talk (01 IYO , throughout the server chip, 
chip-fiber interface, to the FMF, and last, back to the node chip, where Oj; is the 
optical intensity collected at the jth port on the node chip by injecting light at the 
ith port on the server chip. Data were collected every 20 s. During measurement, 
the FMFs (about 1.5 m long each) were fixed to a table. (B) Experimentally 
reconstructed density matrices for the distributed multidimensional entangled states 
between node chips A and B, A and C, and B and C. The density matrix before 
entanglement retrieval between chips A and C is shown for comparison. Column 
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heights indicate the absolute values |p|; colors indicate the phases Arg(p) mod 27. 
Fidelities (F) and mutual information (S) are shown in the plots. The ideal value for 
®;) iS Smax = 3.17. (€) Experimental estimation of the generalized Bell-CGLMP 
parameters (/3) and their quantum correlators (A,B,) between node chips. Dashed 
boxes indicate theoretical values. (D) Correlation matrices for multidimensional 
chip-fiber-chip QKD. Z indicates the computational basis, and F indicates the Fourier 
basis. The secure key rates in bits per coincidence are measured for the two- 
and three-dimensional entangled states. Color bars indicate the measured 
probabilities. Error bars in (C) and values in parentheses in (B) and (D) indicate 
+lo uncertainty, estimated from Poissonian photon statistics. 


(Fig. 2B). Measured highly overlapped flat-top 
spectra of the MRRs arrays are shown in Fig. 
2D (3 dB bandwidth of around 0.8 nm for each 
MRR); the wafer-scale test showed good fab- 
rication tolerance of the design (29). We fur- 
ther characterized the photon pair sources by 
measuring reversed Hong-Ou-Mandel quan- 
tum interference. The measured visibilities 
[with a mean of 0.967(32) for all 18 interference 
fringes (29), where the number in parentheses 
is +lo uncertainty] confirm the high spectral 
indistinguishability of single photons after 
MRR demultiplexing. 

The multipath-encoded states are inherently 
phase-stabilized on chip (30), whereas encod- 
ing d-dimensional states in a superposition 
of multiple eigenmodes in MMFs is preferred 
owing to relatively stable phases between 
internal modes. We devised a multidimen- 
sional chip-fiber interface that can coherently 
transform between on-chip path encoding and 
polarization-mode encoding in fiber. This in- 
terface consists of a mode multiplexer (37) and 
a two-dimensional multimode grating coupler 
(MMGC) (82). Designs are provided in (29), sec- 
tion 1. The coherent transformation between 
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on-chip multipath-encoded states and polariza- 
tion mode-encoded states in FMFs works as (29) 


|0)<>| TEy)>|LPue), |1)<>|TEor) |LPoix) 
|2)<>|TEoy)|LPoty) , |3)<>|TE1y)<>|LPny) 


where |7Eo) and |7E;) represent the logical 
states in the fundamental and first-order 
transverse electric (TE) modes of waveguides, 
|LPo1) and |LP,,) represent the logical states in 
the LP modes of FMFs, and w and y represent 
two orthogonal polarization states. On the 
node chip, the states are reversely transformed 
back to path encoding. Simulated and exper- 
imental LP-mode distributions at the output 
of FMF are shown in Fig. 2E [a full data set is 
provided in (29)]. The measured spectra from 
the single-mode input, to mode multiplexer, 
MMGC, and through FMF are shown in Fig. 
2F. It shows a 3 dB bandwidth of about 15 nm 
and a mode-dependent loss of about 2.9 dB. 
Higher-order mode multiplexer—for example, 
up to 11 modes (33)—and high-performance 
multimode chip-fiber edge couplers (34) have 
been realized with multimode silicon photon- 


ics (35, 36), and MMFs can easily support hun- 
dreds of LP modes—for example, 400 modes in 
a typical 50 um-diameter MMF (23)—therefore 
promising substantial scalability of quantum 
network’s dimension. 


Multidimensional entanglement retrieving 


Although MMFs show access to huge infor- 
mation capacity, lights that propagate in these 
complex media experience troublesome deg- 
radation that results in mode scrambling and 
entanglement degenerating (J9, 20). In our 
experiment and in most real-world systems, 
the transmission matrix of channels—including 
in this work the path-to-mode transformation 
and chip-fiber interface—is generally scrambled 
and nonunitary. This is caused by inevitable 
mode loss and cross-talk in MMFs because of 
fiber bending, twisting, or stretching, and mode 
mismatching at the interface. On the basis of 
Miller’s scheme (25, 26), we realized a QER meth- 
od that can actively retrieve multidimensional 
entanglement from any complex scattering 
(nonunitary) process in the reciprocal sys- 
tem. By all-optically implementing the QER, 
the network can self-train its controllable 
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chips (node and server) to retrieve multidi- 
mensional entanglement without any need 
of reconstructing the matrix of channels and 
matrix processing [discussion is available in 
(29)]. The QER consists of two stages: a clas- 
sical channel unscrambling (CCU) algorithm 
and a quantum amplitude balancing (QAB) 
strategy (Fig. 3A). The overall process is sum- 
marized in Fig. 3B. More details are provided 
in (29), section 4. 

In the first stage, the CCU algorithm is carried 
out to unscramble the strongly mixed modes 
in chip-fiber-chip channels and reconstruct the 
orthogonal supermodes. This process can be 
regarded as the singular value decomposition 
(SVD) of an unknown nonunitary transmis- 
sion matrix A = UXV*, where ¥ is a diagonal 
matrix with non-negative diagonal elements 
ox (singular values) arranged in descending 
order (69 = 6; = ... = Gg = 0), and Vand Uarea 
unitary matrix composed of the right-singular 
vector vu, and left-singular vector wz, respec- 
tively. The right and left singular vectors are dual 
invariant vectors of A : Av, = 6,t,, A’ M;, = 
o~0,, Where overlines denote complex con- 
jugate. Hence, the superposition, in the form 
of right- and left-singular vectors, of initial 
encoding bases constitutes the orthogonal in- 
variant input- and output-supermodes |w;,) and 
|w;,) of the channel, respectively, and singular 
values correspond to supermode-dependent 
losses. 

We performed SVD on the transmission 
matrix and determined the invariant super- 
modes through bilateral power iterations (we 
label signal photon as “s” and the idler photon 
as “i” in the following), using two triangular 
linear optical circuits (V, on the server chip and 
U, on the node chips). The dominant supermode 
with singular value og could be reconstructed 
first. On the server chip, the light beam with unit 
intensity was injected into the first port |0) of V, 
(initially set as identity), which can be expanded 
as the superposition of input-supermodes as 
19) =|0) server = So %el@e), Where dy is the 
complex coefficient. The forward transmis- 
sion of light through the FMF is equivalent 
to acting the transmission matrix .A, on the 
state |e) = As|@o) = Sop SMM) Then, 
on the node chip, tunable beamsplitters and 
phase-shifters in row 1 of U, were adjusted to 
minimize the output power {O3, Oz, Oi} 
jewel: se all power into the |0) sae 
mode, U" 11) <|0) noge (Superscript is the 
iteration step). Then, the beam was backwardly 
sent into the first mode of the node chip. The 
reversed state is |@,) = Aue 10) node & 
> o%KAK Pr) (the backward-transmission 
matrix is Ar ). We then adjusted V, to get the 
10) serven Physically repeating this all-optical 
process is equivalent to alternatively acting 
As and Ar on the vector. In the nth step, the 
added weight of Ath supermode is oe", which 
means that the nondominant components 
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are exponentially compressed as iteration goes. 
When converging at fth seer On it returns 
ae gominany supermode V”|0) server |Vo)s 

7 0) node—|tho) with the minimum loss. 
ae repeating the bilateral power iterations 
for different input ports |X) allowed the map- 
ping for all other supermodes with descending 
singular values (increasing loss). 

Maximal entanglement, however, cannot 
be retrieved only through CCU. The signal and 
idler photons of the state |) were transmitted 
through two separate FMFs. Mapping from path- 
encoded states onto supermodes, the transmis- 
sion matrices turn into a diagonal in the new 

basis: A, > 2,Ps, Aj > XP, and the transmitted 
state becomes ed Pa (Z3Ps ® LiPi)|@z) = 
12. ot 8) gl) ell +%) |2e) lie), where =, and 
; are matrices of singular values oO, and Oo; in 
descending order, and P; and P; are redundant- 
phase matrices from the nonuniqueness of 
SVD: P; = diag[e, eee] and Pi = 
diag [e’* , ce’, e', e's]. The state |) remains 
partially entangled, and the amplitudes need to 
be balanced, for which QAB was used. QAB first 
reverses the encoding basis of one photon (such 
as the idler) on the server chip by operating a 
permutation matrix R that exchanges |X) and 
|3 — k). That gives a new combination of two 
singular-value sets, and the state can be described 
as |¥1) =1/2)_ of) of) ,ef(@r+s-»)|k)|3 — hy. 

k 

The square of the factor ool), propor- 
tional to coincidence counts can be tailored to 
realize the ae entangled state |W) = 


1/2min{o,), 03 re}! (G:+45-«) ie) |3 — ie), by 


configuring pump a itu into the sources 
to alter the squeezing parameter or by slightly 
shifting the resonance of MRRs to adjust the 
spectral overlapping of photons (Fig. 3B, A,j). 
The value of min { 3 oy x} reaches maximum 
(minimum total loss) among all combinations 
of two singular-value sets for QAB (29). The 
phase factors can be simply compensated with 
phase shifters (Fig. 3B, Pj). 

We adopted quantum mutual informa- 
tion to characterize the entanglement, de- 
fined as S(A: B) = S4 + Sg - Sap, where Sac = 
-TrpagyLogop4g] is von Neumann entropy 
for the subsystem and Sz is von Neumann en- 
tropy for the whole system. The non-negative 
S value decreases with less entanglement. 
We chose 100 pairs of random transmission 
matrices {.A,, Aj} to validate the QER. Simu- 
lation results in different scenarios are shown in 
Fig. 3C. Implementing the full QER, the mutual 
information is recovered to the maximum 
value. The experimental results of unscram- 
bling mixed modes in the multidimensional 
chip-fiber-chip channel by means of CCU are 
reported in Fig. 3D. The four supermodes 
were successively unscrambled from the one 
with the largest singular value (minimum loss) 
to the one with the smallest value (maximum 


loss). The recovered modes possessed much 
lower cross-talk. To show its online recovery 
ability and high robustness, which are key 
for practical quantum networking, we inten- 
tionally interrupted the FMF by bending the 
fiber and misaligning fiber-chip coupling 
(Fig. 3D, red triangle with exclamation point), 
and the algorithm surmounted it rapidly. Trun- 
cating lossy modes under the communication- 
capable threshold in larger-dimensional channels 
can further speed up the algorithm and reduce 
the total loss of the state. For this work, we 
truncated the lossy fourth mode to increase 
the count rates and prototyped the retrieved 
multidimensional entanglement network. 


Entanglement distribution across 
multichip networks 


We measured the stability of the whole system 
by monitoring the cross-talk matrix over a long 
time (Fig. 4A). Any occurrence of state rotations, 
whether on chips or in FMFs or their interface, 
could be monitored with this measurement. 
We observed fluctuation below +2.5% within 
24 hours, indicating a well-stabilized multi- 
dimensional chip-fiber-chip quantum system. 
No recalibration or realignment were needed 
during all the measurements in this work. 
We then coherently distributed three pairs 
of three-dimensional entangled photons from 
the server chip to three node chips through 


three FMFs. We implemented the full QER ‘+ 


method to retrieve entanglement. To verify the 
presence of retrieved entanglement, we imple- 
mented quantum tomographical measurements 
using compressed sensing techniques (37) to 
reconstruct the density matrices (p) and cal- 
culate the bipartite mutual information (S), 
after distributions across the multichip net- 
work. The results for reconstructed density 
matrices are provided in Fig. 4B. Quantum 
state fidelity is defined as F = (®}|p|®;), 
where |®) is the ideal pure state, and the 
mutual information S is estimated from p. We 
obtained F = 0.884(10), S = 2.59(4) between 
node chips A and B, F = 0.912(7), S = 2.70(3) 
between node chips A and C, and F = 0.874(20), 
S = 2.60(10) between node chips B and C. As a 
comparison, in Fig. 4B, we show the density 
matrices before and after entanglement re- 
trieval for node chips A and C, demonstrating 
the success of QER. The degradation is mainly 
due to the imperfect QER caused by some im- 
provable defects of our devices (29). We tested 
a generalized Bell Collins-Gisin-Linden-Massar- 
Popescu (Bell-CGLMP) inequality (38) to certify 
the presence of multidimensional entangle- 
ment after distributions between all paired 
node chips in the network [(29), section 5]. 
The experimentally measured expected values 
of quantum correlators (A,B) ({a, b} € {1, 2}) 
and the measured Bell parameter /; are shown 
in Fig. 4C. We estimated J; of 2.31(9), 2.45(8), 
and 2.30(6) for the three pairs of the entangled 
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state, which confirm the experimental viola- 
tions of the classical bound by 3.440, 5.630, and 
5.000, respectively. The simultaneous viola- 
tions of inequalities for all linked chips there- 
fore confirm the success of multidimensional 
entanglement distributions in the multichip 
quantum network. 

Additionally, our multichip system has the 
potential to provide higher information capac- 
ity and greater noise resistance for QKD. In Fig. 
4D, we report the experimental correlation 
matrices for multidimensional entanglement- 
based QKD. The two chips in the network are 
measured in the mutually unbiased bases. We 
obtain higher bits per coincidence (bpc) of 
0.346(37) for three-dimensional entangled states 
even with a larger quantum bit error rate (QBER; 
defined as the infidelity) of 10.3%, compared 
with 0.301(33) bpc and 4.6% QBER for two- 
dimensional entangled states, respectively. 
Details of multidimensional entanglement- 
based QKD are provided in (29), section 6. 


Conclusions 


We have demonstrated a multichip multidimen- 
sional quantum entanglement network with 
retrievability using silicon-photonic hybrid multi- 
plexing technologies. The full spectrum of hybrid 
encoding and multiplexing devices have been 
monolithically integrated on chips, delivering 
one of most complicated integrated quantum 
photonic devices in terms of circuitry com- 
plexity and functionality. The scalabilities of 
networking architecture, integrated quantum 
devices, MMF channels, and entanglement re- 
trieving techniques have been demonstrated 
and verified. In future real-world long-distance 
quantum networks, environment noises that 
result in mode cross-talk and phase drift in 
MMF s could be real-time all-optically corrected 
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by using fast switching and detection of light 
in silicon (29). Our work points toward the 
practical implementations of large-scale, chip- 
based quantum entanglement networks for 
quantum information processing and com- 
munications (/, 2). Technical advances in en- 
tanglement retrieving and integrated-photonics 
quantum information processing could pro- 
vide a programmable way for operating quan- 
tum light in ultrahigh-dimensional fiber-based 
complex media, which could be amenable to 
fundamental and applied sciences (79, 20). 
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Correlating the charge-transfer gap to the maximum 
transition temperature in Bi,Sr2Ca,;Cu,02,+4+5 


Zechao Wang**+, Changwei Zou*+, Chengtian Lin*, Xiangyu Luo®, Hongtao Yan°, Chaohui Yin®, 
Yong Xu**7, Xingjiang Zhou’, Yayu Wang'**, Jing Zhu?2* 


As the number of CuO, layers, n, in each unit cell of a cuprate family increases, the maximum transition 
temperature (T. max) exhibits a universal bell-shaped curve with a peak at n = 3. The microscopic 
mechanism of this trend remains elusive. In this study, we used advanced electron microscopy to image 
the atomic structure of cuprates in the Bi2Sr2Ca,;Cu,02,+4+5 family with 1 < n < 9; the evolution of the 
charge-transfer gap size (A) with n can be measured simultaneously. We determined that the n 
dependence of A follows an inverted bell-shaped curve with the minimum A value at n = 3. The 
correlation between A, n, and Tmax may clarify the origin of superconductivity in cuprates. 


dentifying universal trends of the super- 
conducting transition temperature (T,) 
and its correlations to other physical param- 
eters may provide crucial clues for elucidat- 

ing the origin of superconductivity (7). For 
example, the isotope effect T,.°*1/ VM, where 
M is the isotopic mass, has inspired the phonon- 
mediated pairing picture in establishing the 
microscopic theory of conventional supercon- 
ductivity in metals and alloys (2, 3). Since the 
discovery of high-T, superconductivity in cop- 
per oxide materials (referred to as cuprates), 
there have been considerable efforts in under- 
standing what controls 7. There are two well- 
established trends about 7, in cuprates: One is 
the dome-shaped doping dependence of T, for a 
specific cuprate compound, where the maxi- 
mum 7) (Temax) 1s located at hole concentration 
p ~ 0.16 (1). The other is the variation of Tomax 
with the number of CuO, planes (7) per unit 
cell in a homologous series, which reaches the 
highest when 2 = 3 (4). Understanding the 
physical origin behind these empirical rules 
could lead to the eventual solution for the 
pairing-mechanism problem in cuprates (J, 4). 
Whereas the second trend remains univer- 
sal for many cuprate families (Fig. 1A), the first 
is apparently violated in the trilayer cuprate 
BigSrgCayCuzO 49,3 (Bi-2223), which has the high- 
est Tomax Of ~115 K in the BiySryCay;Cu,Oons4+5 
family (Bi-family) compounds. Recent scanning 
tunneling microscopy (STM) studies (5) have 
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shown that the superconducting gap of Bi-2223 
decreases monotonously with p in the over- 
doped regime. Unexpectedly, the 7, in this 
doping range exhibits a plateau rather than 
a continuous drop, most likely owing to the 
existence of two types of inequivalent CuO, 
planes (6, 7). By contrast, the Bi-2223 system 
provides an ideal platform for investigating 
the evolution of T.max With n and the physi- 
cal parameters that control Tomax. A promising 
starting point is the Mott-insulating “parent” 
state (p ~ 0) of cuprates, where the electronic 
structure can be described by a single param- 
eter referred to as the charge-transfer gap 
(CTG) A, the size of which varies from 1 to 3 eV (8). 
A previous STM work (9) in BiSrsCa,,;Cu,,Oo,448 
and Ca,,,;Cu,,O,,,2Clz with n = 1 and 2 suggests 
that Tomax is anticorrelated with the A, together 
with theoretical calculations (10-12). However, 
a measurement covering the whole homologous 
cuprate series has not yet been achieved because 
of two main obstacles. First, it is difficult to syn- 
thesize high-quality single crystals for n = 4 in 
any cuprate family, and it is even more difficult 
to reach the p ~ 0 limit. Second, most of the 
spectral information about cuprates cannot 
distinguish the distinct CuO, planes for n = 3. 

The development of advanced electron micros- 
copy enables one to achieve through-depth 
(defined in this paper as along the crystallo- 
graphic c axis) imaging and carry out high-spatial 
(subangstrom)-resolution electron energy-loss 
spectrometry (EELS) experiments. Thus, it 
allows a local probe of the intrinsic structural 
and electronic properties in confined unit cells 
(13). Here, we used scanning transmission 
electron microscopy (STEM) and EELS tech- 
niques to directly image the layer-by-layer lat- 
tice configuration and electronic structure of the 
Bi-family cuprates, covering an unprecedented 
layer structure range with 1 < n < 9. 


Imaging the atomic structure of the Bi-family 
compounds with 1 <n < 9. 


The evolution of Tmax With n for the Bi-family 
cuprates is indicated with a bold orange line in 


Fig. 1A, which displays the universal bell-eha jee 
trend (/4-19). Owing to the high spatial rL-2—— 
lution of the STEM technique, the layered 
atomic structure can be clearly visualized, as 
shown in Fig. 1B for an optimally doped Bi- 
2223 sample. Each Cu atom in the inner CuO, 
plane (IP) forms an in-plane CuO, plaquette, 
whereas in the outer CuO, planes (OP), the 
configuration is a CuO; pyramid with one apical 
oxygen (Fig. 1B, right). Some previous studies 
(20, 21) have suggested that the different envi- 
ronments of IP and OP may have a profound 
influence on their superconducting properties, 
but their layer-resolved electronic properties 
remain to be unveiled. 

Figure 1, C to K, displays a series of high- 
quality cross-sectional STEM images, in which 
BigSroCa,4Cu,Oon+4+3 With 1 < n < 9 are all 
identified. We obtained the data for n = 3 by 
carefully searching within a dozen Bi-2223 sam- 
ples, which include very scarce regions that have 
four or more CuO, planes per unit cell. The 
details about the search and discovery of the 
n > 3 phases are shown in fig. $5. Because of 
the quasi-two-dimensional nature of Bi-family 
cuprates, it has been shown recently that all 
the essential electronic properties are contained 
within a unit cell, BiO/SrO charge reservoir 
layers plus the CuO, layers (22). The method- 
ology used here opens a gate for studying 
multilayer cuprates. 


Measuring the charge-transfer gap of the 
Bi-family compounds. 


The electronic band structure of cuprates is 
well characterized by the in-plane Cu 3d,2_,2 
and doped O 2p,/2p, orbitals (23). Within the 
Zaanen-Sawatzky-Allen scheme (24), the un- 
doped cuprate is a charge-transfer-type Mott 
insulator, in which the lowest-energy excita- 
tion is from the O charge-transfer band (CTB) 
to the unoccupied upper Hubbard band (UHB) 
of the Cu sites, which is schematically illus- 
trated in Fig. 2A. Figure 2B shows the STM 
spectra in three insulating Bi-family cuprates 
with n = 1, 2, and 3, in which the A value be- 
tween the edges of CTB and UHB can be 
clearly extracted. The results here expand on 
those of previous studies (9) to n = 3 and 
reveal the same trend of a monotonic decrease 
of A from 1.5 eV in Bi-2201 to 1.0 eV in Bi-2212 
and 0.7 eV in Bi-2223. 

Upon hole doping, an additional feature 
emerges from the CTB that is known as the 
Zhang-Rice singlet (ZRS) band (Fig. 2C). This 
feature can be attributed to the hybridized 
state of a doped hole involving one Cu-3d and 
the four nearest O-2p orbitals (25). In this 
circumstance, the main features in the STM 
spectra are the spectral weight transfer from 
high energy to low energy and the gradual for- 
mation of the pseudogap (26); the A value 
becomes less well defined. Nevertheless, the 
energy difference between the centers of the ZRS 
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Fig. 1. The T, max and atomically resolved crystal structure of BizSr2Ca,.,;Cu,02,+4+3 cuprates with 1 < n < 9. (A) Relationship between T,.max and n for the 
homologous series of n-layered cuprates (14-19) including Tl-12(n-1)n, Hg-12(n-1)n, Cu-12(n-1)n, and Bi-22(n-1)n. The bold orange line indicates the Tomax aS a 
function of n in the Bi-family. The n = 9 phase has never been discovered, so the Tomax value was obtained with linear extrapolation (as indicated with the black 
dotted line) and represented by a hollow circle. (B) A zoomed-in area of the n = 3 sample (Bi-2223) with the schematic crystal structure shown at right. OP, outer 
CuOz plane; IP, inner CuOz plane. (C to K) Cross-sectional layered structure in the Bi-family cuprates with 1 < n < 9, where the c axis direction is along the vertical 
blue arrow. The yellow arrows indicate the positions of CuOz planes in the crystal (fig. $4). Scale bar, 5 A. All the data were taken at T = 300 K. 


Fig. 2. Probing A with STM and STEM-EELS 
techniques. (A) Schematic band structure of the 
undoped cuprates. LHB, lower Hubbard band; CTB, 
charge transfer band; UHB, upper Hubbard band. 
(B) The Mott-insulator type d//dV (differential 
conductance) spectra of undoped Bi-2201 (n = 1), 
Bi-2212 (n = 2), and Bi-2223 (n = 3) samples taken 
with STM. The A here characterizes the distance 
between the edges of CTB and UHB, as shown in (A). 
(C) Schematic band structure of hole-doped cup- 
rates. ZRS, Zhang-Rice singlet. The red and blue 
arrows indicate the EELS excitation process from 
core levels to the unoccupied UHB and ZRS, 
respectively (the principle is shown in fig. S6). 

(D) STEM-EELS of the O K edge from n = 1 to 3. 
The A here characterizes the distance between the 
centers of ZRS and UHB, as shown in (C). The 
Gaussian functions were used to simulate the 

ZRS (red dotted lines) and the UHB (purple, IP; 
green, OP). (Left) The STEM images show the real- 
space areas where the EELS are taken. All the 
STEM-EELS data were taken at T = 300 K. a.u., 
arbitrary units. 


and UHB is still a valid parameter that charac- 
terizes the charge-transfer energy (27), which 
varies little with doping (28, 29). The state-of- 
the-art STEM-EELS is an ideal technique for 
measuring such peak-to-peak CTG, where the 
spectral peaks shown in Fig. 2D indicate the 
large probability of exciting core-level electrons 
to the unoccupied ZRS and UHB in Fig. 2C (27), 
in analogy to the x-ray absorption spectroscopy 
(XAS) at the O K edge (excitation energy of an 
O Is core electron to an empty state). In par- 
ticular, the ZRS peak centered around 528.5 eV 
(Fig. 2D, red dashed line) corresponds well with 
the measurements of the bulk-sensitive XAS 
technique (27). The ZRS peak is robust in cup- 
rates against doping, temperature, and materials. 

Similar to that of STM, the A obtained with 
EELS is mainly controlled by the unoccupied 
UHB, which displays a red-shift trend from 
n = 1 to 3, as revealed in Fig. 2D. We extracted 
the A size from a reliable fitting procedure as 
described in (30), which renders A = 2.5 eV for 
Bi-2201 and A = 2.0 eV for Bi-2212, respectively. 
Owing to the high atomic-plane resolution, we 
distinguished the CTG corresponding to the OP 
(Aop = 1.7 eV) and IP (Ap = 1.5 eV) in trilayer 
Bi-2223. The spectral weight ratio between the 
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ZRS and UHB is smaller in the IP, which is sug- 
gestive of the lower hole concentration (27). 


Correlating observables with the number of 
CuO; layers. 


To investigate how the A size evolves with the 
number of CuO, layers per unit cell, we have 
further determined the STEM-EELS of the O K 
edge with 1 < n < 9 (Fig. 3, A and B). The 
definition and description of different CuO. 
planes where we took the STEM-EELS meas- 
urements are shown in fig. S7. We repeated 
the measurements 10 times on different areas 
with the same 7 to improve statistical robust- 
ness (full dataset is provided in figs. S8 to S17) 
(30). For clarity, we only show the averaged 
spectra for the OP layer; those for the IP layers 
are shown in fig. S19. The overall trend is 
shown in Fig. 3, C and D, which clearly dem- 
onstrate that the nm dependence of A is rep- 
resented by an inverted bell-shaped curve with 
the minimum A = 1.8 eV at 2 = 3. There is an 
apparent anticorrelation between the n de- 
pendence of A and T¢max- For all the phases in 
which n = 3, the A value of the IP is smaller 
than the A value of OP, but exhibits the same 
dependence on 7 (Fig. 3D). 
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In the scenario of single-band Hubbard model, 
we may define an effective superexchange energy 
Jose ~ 4 of¢/A, Where tere is the effective hop- 
ping term that is proportional to bral A , and ta 
is the hopping integral between neighboring O 
2p and Cu 3d orbitals (8). Because the Jeg 
between local Cu moments has been proposed 
to be responsible for the spin singlet pairing 
(31), Jeg ~ 1/A® thus correlates the CTG and the 
Cooper pairing strength. For 7 = 1 and 2, only 
the OP is present, and the increase of Jop is 
consistent with that derived through STM (9) 
and inelastic photon scattering (32). For the 
whole range with 1 < n < 9, Fig. 3E shows that 
the n dependence of Jeg for the OP layer 
exhibits a bell shape highly analogous to that 
Of Tomax: It reaches a peak at n = 3 and de- 
creases for 4 < n < 9. The close correlation 
between Jog and Tomax indicates that the A, 
and thus the underlying strong correlation 
effects, determine Tomax, at least in the Bi-family 
cuprates with 1 < n < 9. 

The combined STEM-EELS and STM results 
here touch on the central issue regarding the 
bell-shaped evolution of Tmax with 7, which is 
universal in all cuprate families. From the 
theoretical point of view, a previous study (4) 
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has proposed that the enigmatic balance be- 
tween Josephson tunneling and competing 
electronic orders is the underlying physical 
mechanism. Our results reveal that it is the 
CTG, which displays an inverted bell-shaped 


A 


trend compared with the T.max that plays a 
deterministic role in this empirical rule. This 
conclusion is supported by the consistency of 
A characterized by energy scales between band 
centers (EELS and XAS) and edges (STM). We 
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Fig. 3. The evolution of A with n in Bi-family cuprates. (A) STEM-EELS with a 
higher energy range of the O K edge with 1 < n < 9. For each curve, we averaged 
the data measured on the OP layer of 10 different areas with the same nr (full 
datasets are provided in figs. S8 to S17). (B) Zoomed-in STEM-EELS of (A). The 
Gaussian functions were used to determine the position of ZRS and UHB (fig. 
S18). (€ and D) The statistical results of A for the OP and IP with 1 <n < 9, which 
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summarize the CTG sizes taken with different 
experimental probes in Fig. 4. The difference 
between STM and EELS shows a rigid shift of 
~1 eV, which is caused by the finite bandwidths 
and provides useful information about the 
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was determined with 10 measurements taken under the same experimental 
conditions. All the data were taken at T = 300 K. Each dot and error bar in (D) 
are the calculated average and standard deviation of the result in (C) for the 
same n. (E) The effective superexchange Jer, ~ 1/A° as a function of n, assuming 
the same ter for the OP and IP of different n in a homologous series. a.u., 
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Fig. 4. The correlation between A, n, and Temax- (A and B) Summarized plots of (A) A versus n and (B) A versus Tomax for the Bi-family cuprates. Our STM and 
EELS (on the OP) data, as well as previous XAS data, are shown together. The STM results are generally ~1 eV smaller than the EELS and XAS values, owing to the 
finite bandwidth or hopping integral t. The wide gray-red and blue-green strips are guides to the eye. See table S1 for more details. 
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effective hopping integral in the Bi-family 
cuprates. 


Discussion and outlook 


Our study shows that the CuO, planes with 
different apical environments have very dif- 
ferent A, which is seen not only in the evo- 
lution with 2 but also in that the IP always 
has a smaller A size than the OP of the same 
compound. The Jeg of IP is thus larger than 
that of OP, as shown in Fig. 3E, which is con- 
sistent with previous studies of the pairing 
strength (2/, 33). Therefore, the ultimate reason 
why A, and hence Tomax, Varies substantially for 
different n in a homologous series is most likely 
related to the orbitals out of the CuO, plane, 
indicating an interplay between the under- 
lying electronic structure and orbital param- 
eter symmetry in cuprates (4, 34, 35). 
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Recruited macrophages elicit atrial fibrillation 
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Atrial fibrillation disrupts contraction of the atria, leading to stroke and heart failure. We deciphered how 
immune and stromal cells contribute to atrial fibrillation. Single-cell transcriptomes from human atria 
documented inflammatory monocyte and SPP1* macrophage expansion in atrial fibrillation. Combining 
hypertension, obesity, and mitral valve regurgitation (HOMER) in mice elicited enlarged, fibrosed, and 
fibrillation-prone atria. Single-cell transcriptomes from HOMER mouse atria recapitulated cell 
composition and transcriptome changes observed in patients. Inhibiting monocyte migration reduced 
arrhythmia in Ccr2’- HOMER mice. Cell-cell interaction analysis identified SPP1 as a pleiotropic signal 
that promotes atrial fibrillation through cross-talk with local immune and stromal cells. Deleting Spp1 
reduced atrial fibrillation in HOMER mice. These results identify SPP1* macrophages as targets for 


immunotherapy in atrial fibrillation. 


n healthy atria, excitations arising in the 
sinus node coordinate atrial contractions 
that help fill the ventricles in diastole. 
Cardiomyocytes constitute 30% of atrial 
cells (7). The remaining 70% of atrial 
cells are fibroblasts, endothelial cells, and 
resident immune cells. Together, these pro- 
vide cardiomyocytes with housekeeping 
functions, including matrix scaffold, blood 
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supply, waste removal, and immunosurveil- 
lance (2, 3). Atrial fibrillation (AFib) is the 
most common arrhythmia (4). During AFib, 
the lack of coordinated atrial excitation and 
contraction slows blood flow into the ventri- 
cles. This sluggishness can diminish cardiac 
performance and give rise to atrial blood clots 
that cause thromboembolic stroke. 

Inflammation may contribute to AFib through 
electrical remodeling (5-7), during which al- 
tered ion handling perturbs the action po- 
tential (8). Fibrosis, another consequence of 
inflammation, induces heterogeneous atrial 
depolarization, which provides an opportu- 
nity for electrical conduction reentry and sub- 
sequent AFib (8, 9). To examine how immune 
and stromal cells contribute to AFib, we per- 
formed single-cell RNA sequencing (scRNA- 
seq) of freshly isolated normal and diseased 
human atrial tissue. 


Macrophages expand in human atrial disease 


We collected left atrial tissue from five control 
participants and seven patients with per- 
sistent AFib undergoing heart surgery (table 
S1) and performed scRNA-seq on the 10X 
Genomics Chromium platform (Fig. 1A and 
fig. S1A). A total of 41,609 viable, single cells 
from both patients with AFib and controls 
were partitioned into clusters (fig. SIB), which 
we annotated using known lineage markers 
(fig. SIC). We identified six major noncardio- 
myocyte cell types, which included (in order 
of decreasing frequency) lymphocytes, mono- 
nuclear phagocytes and dendritic cells (MP/DCs), 
endothelial cells, fibroblasts, mural cells, and 
neutrophils (Fig. 1, B and C; fig. S2; and table $2). 

In diseased atria, the MP/DC cluster ex- 
panded twofold, whereas endothelia and 
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mural cells decreased in frequency. The 7 ee 
cell classes remained stable (Fig. 1C and tL... 
$2). Because the MP/DC cluster increased 
considerably, we scrutinized the transcrip- 
tomes of these cells more closely. We iden- 
tified genes that were differentially expressed 
in MP/DCs from AFib and control atria with a 
pseudo-bulk approach that aggregated cells 
from the same individual to determine differ- 
ential gene expression between the two patient 
groups (10). We then performed gene set 
enrichment analysis (GSEA) (77) on the result- 
ing list of differentially expressed genes to 
define the pathways and functional gene groups 
that were altered in MP/DCs from patients with 
AFib. In this disease setting, MP/DCs up- 
regulated gene sets related to the inflamma- 
tory response, interferon-y (IFN-y) signaling, 
monocyte recruitment, and cholesterol metab- 
olism (Fig. 1D and data S1). Thus, atrial MP/ 
DCs undergo substantial immunometabolic 
remodeling during their expansion in patients 
with AFib. 

To gain a broader understanding of the 
transcriptional programs that are activated 
or repressed in MP/DCs, we used weighted 
correlation network analysis to study groups 
of correlated genes that were differentially ex- ‘ 
pressed between patients and controls (12). 
We identified 24 modules of correlated genes 
across all participants, three of which were sig- 
nificantly correlated with disease state. Module + 
1 was up-regulated in AFib and contained in- 
flammatory and profibrotic genes such as 
CCR2, IL10, ITGA9, MMP9, SPP1, TIMP2, and 
VIM (Fig. 1E and data S2) (13-15). Overrepre- 
sentation analysis confirmed that inflamma- 
tory and extracellular matrix-related gene 
sets characterized module 1 (Fig. 1F and data 
$2). Module 1 genes SPP7 and CCR2 and the 
cell surface markers TREM2 and CD9 were 
among the top up-regulated genesin MP/DCs - 
from patients with AFib (Fig. 1G). SPPI en- ‘ 
codes the matricellular protein osteopontin, 
which participates in bone remodeling and 
fibrosis by promoting cell survival, adhesion, . 
and migration as well as inflammatory cell 
activation (13). The chemokine receptor CCR2 
mediates monocyte egress from the bone mar- 
row (16) and is essential for monocyte re- 
cruitment (74). Gene modules 2 and 3, whose 
expression was down-regulated in MP/DCs 
from patients with AFib, included genes in- 
volved in protecting against cardiac disease 
[SIRT2 (17)], cardiac resident macrophage pro- 
liferation [KLF4 (18)], and epigenetic con- 
straint of cytokine production [HDAC2 (19) 
and ODCI (20)] (Fig. 1E and data S2). These 
data imply that the atria of patients with AFib 
have elevated numbers of myeloid cells that 
assume a more inflammatory phenotype, impli- 
cating macrophages in the tissue remodeling 
that disrupts coordinated excitation of atrial 
cardiomyocytes. Other leukocytes and stromal 
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Fig. 1. Single-cell atlas of human atrial disease. (A) scRNA-seq was 
performed in five controls and seven patients with AFib. (B) UMAP delineates six 
major cardiac cell types. (C) Distribution of cell populations. Color code 
corresponds to (B). (D) Gene ontology biological process (GOBP) gene sets 
(48, 49) up- or down-regulated in AFib from GSEA of MP/DC cluster. Circle size 
denotes number of enriched genes. ECM, extracellular matrix. (E) Gene 
expression in MP/DC cluster represented by z-scores of the log-transformed 
normalized counts. Rows, samples grouped by disease state; columns, genes in 
the three gene modules significantly associated with disease state. Gene 
coexpression modules and exemplary genes are labeled. (F) Select MSigDB 

C2 gene sets (50) overrepresented in module 1 described in (E). GM-CSF, 
granulocyte-macrophage colony-stimulating factor; HGF, hepatocyte growth factor; 
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Macrophage cluster 


UMAP1 


IL-4, interleukin-4; KEGG, Kyoto Encyclopedia of Genes and Genomes. (G) Genes in 
MP/DC cluster ranked according to logs-fold changes [logo(FC)] between seven 
patients with AFib and five controls. (H) Subclustering of 10,555 MP/DCs from 
five controls and seven patients with AFib. (lI) Difference in abundance of cells 
belonging to MP/DC subclusters between seven patients with AFib and five control 
patients. Pie size indicates log2(FC); color (blue, high; red, low) indicates the P value 
(two-tailed Student's t test). **P < 0.01. (J) Fraction of SPPI* macrophages in 
controls and patients with AFib. Bar graph shows mean + SEM, n = 5 to 6 per 
group, two-tailed Student's t test. (K) Gene expression levels (represented by log- 
transformed normalized counts) across macrophage clusters; Kruskal-Wallis 

(P < 2.2 x 10") followed by two-tailed Wilcoxon rank sum test. ****Adjusted 

P =2x 10° (L) SPPI expression in MP/DCs from controls and patients with AFib. 
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Fig. 2. Single-cell atlas of mouse atria. (A) HOMER mice are exposed to experiments, two-tailed Mann-Whitney test. (D) AFib inducibility and burden. 
mitral valve regurgitation (MR), high-fat diet (HFD), and angiotensin 2 (Ang2). n = 8 to 12 per group from two independent experiments. (Left) Two-sided 
(B) Doppler of MR. (C) (Left) MRI of left atrium (LA). Scale bars, 3 mm. Fisher's exact test. (Right) Two-tailed Mann-Whitney test. Other parameters 
(Right) LA end-systolic volume, n = 6 to 10 per group from two independent are shown in table S7. SR, sinus rhythm. (E) scRNA-seq was performed in four 
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sham and four HOMER mice. (F) (Left) UMAP. (Right) Cell population distributions. 
(G) Log.(FC) of cell abundance in mouse and human atria. (H) Flow cytometry of 
sham and HOMER atria. n = 21 per group from five independent experiments, two- 
tailed Student's t test. APC/Cy7, allophycocyanin/cyanine 7; 
thrin/cyanine 7. (I) Scatterplot showing C5 GOBP GSEA species concordance. GSEA 
is performed on lists of genes ranked according to differentia 
disease state. -logj(FDR) is plotted for mouse (x axis) and human (y axis). -logi9 
(FDR) are negative if the gene set is down-regulated. Yellow indicates high and blue 
indicates low gene set density. (J) Genes in MP/DC cluster ranked according to log> 
(FC) between four HOMER and four sham mice. The y axis was broken at —5 for 


cells also altered their gene expression (fig. S3 
and data S3 and S4). Fibroblasts, for example, 
responded by up-regulating inflammatory path- 
ways and matrix production (fig. S3C). In endo- 
thelial and mural cells, we observed up-regulation 
of genes indicative of ischemia and vascular 
remodeling (fig. S4). 

Depending on their origin and microenvi- 
ronment, macrophages and their subsets take 
on phenotypes that enable a range of diverging 
and even opposing functions (3, 21). To identify 
cell types that contribute to atrial remodeling, 
we examined atrial myeloid cells in detail. We 
reclustered MP/DCs, choosing a resolution that 
maximized the number of differentially expressed 
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genes between subclusters (fig. S5, A and B). 
Reclustering separated three monocyte sub- 
sets, an established number in human blood 
(2D), and a fourth inflammatory monocyte clus- 
ter whose expression profile (fig. S5C and data 
85) and uniform manifold approximation and 
projection (UMAP) position adjacent to macro- 
phages (Fig. 1H) suggested an intermediate 
phenotype during monocyte-to-macrophage 
differentiation. At this resolution, we could 
distinguish five macrophage clusters (Fig. 1H, 
figs. S5C and S6, and data S6) and two DC 
clusters (Fig. 1H and fig. S5C). The inflammatory 
mono(b) cluster and the SPPI* mac(c) cluster 


Student's t test. non-CM, noncardiomyocyte. (P’ 
phages in sham and HOMER mice. n = 6 to 7 per group from two independent 


displaying the top down-regulated genes. (K) Subclustering of MP/DCs from 
four sham and four HOMER mice. APC, antigen- 


presenting cell. (L) Atrial Spp1* 
q.n = 4 per group, two-tailed Student's t test. (M) Gene 


expression levels (represented by log-transformed normalized counts) across 
macrophage clusters. (N) Spp] expression in MP/DCs from sham and HOMER mice. 
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Flow cytometry of atrial macro- 
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diseased atria (Fig. 11). The frequency of SPPI* 
atrial cells increased in patients with AFib 
when compared with non-AFib controls (Fig. 1). 
The expanding mac(c) cluster was the dominant 
SPPI source in atria of patients with AFib (Fig. 1, 
Kand L); other immune and stromal cells did 
not express SPP1 at high levels (fig. S7). TREM2 
and CD9, which encode proteins that can be 


distinguished the SPPI* mac(c) cluster from 
other cells (Fig. 1K and fig. $7). RNA in situ 
hybridization confirmed that macrophages 
express SPPI in atria of patients with AFib 
(fig. S8, A and B). Once secreted by macrophages, 


expanded 2.8- and 3.4-fold, respectively, in 


SPP1 associates with extracellular matrix (73). 
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(D) Flow cytometric quantification of TREM2* CD9* left atrial macrophages in C57BL/6 and Ccr2’~ HOMER mice. n = 7 to 9 per group from two independent 

experiments, two-tailed Student's t test. (E) AFib inducibility and burden in C57BL/6 and Ccr2’~ HOMER mice by means of electrophysiological (EP) study. n = 8 
to 9 per group from two independent experiments. (Left) Two-sided Fisher's exact test. (Right) Two-tailed Mann-Whitney test. Other EP parameters are shown in 
table S8. All bar graph data are mean + SEM with individual values for data distribution. 
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detected by cell surface staining (22), further 
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Fig. 4. Macrophage-derived SPP1 promotes atrial disease. (A) EP study of 
AFib inducibility and burden in C57BL/6 BMT HOMER and SppI-’~ BMT HOMER 
mice. n = 13 per group from three independent experiments. (Left) Two-sided 
Fisher's exact test. (Right) Two-tailed Mann-Whitney test. Other EP parameters 
are shown in table S9. (B) Heatmap of SPP1-receptor interactions between SPP1 
* macrophages [mac(e) and mac(g)] and other non-CMs in four HOMER mice. 
SPP1 and cognate receptors are shown on the x axis. Cell populations that 
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express Sppl and the receptor are shown on the y axis and are color-coded 
according to the major cardiac cell types identified in Fig. 2F. Color of scale bar 
(red, high; blue, low) indicates average Spp1 and receptor expression levels in 
their respective interacting subpopulations (represented by mean normalized 
counts) if the enrichment of an interacting SPP1-receptor pair in the given 
interacting subpopulations was statistically significant (P < 0.05). Nonsignificant 
interactions were assigned a value of 0. (C) Representative immunofluorescent 
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staining of macrophages (CD68, green), collagen deposition (COL1, magenta), 
and nuclei [4',6-diamidino-2-phenylindole (DAPI), blue] and histological staining 
of fibrosis (Masson's trichrome staining) in left atrial tissue from C57BL/6 BMT 
HOMER and SppI’” BMT HOMER mice. Scale bars, 50 um. (D) Quantification 

of CD68", COLI", and fibrotic area in left atrial tissue from C57BL/6 BMT HOMER 
and Sppl-’~ BMT HOMER mice. Bar graphs show percentage of positive 


Accordingly, immunofluorescence microscopy 
identified expanding colocalization of SPP1 
protein with collagen in patients with AFib 
(fig. S8, C and D). 

To explore atrial macrophage expansion in a 
larger cohort, we used immunofluorescence 
histology on left atrial appendages of 108 
patients with AFib and/or mitral regurgita- 
tion and 41 controls in sinus rhythm (table $3). 
Atrial macrophages and their monocyte-derived 
CCR2* subset expanded most in patients with 
AFib and mitral regurgitation (fig. S8, E and 
F). In patients who had mitral regurgitation 
without AFib or AFib without mitral regur- 
gitation, macrophages expanded to a lesser 
degree (fig. S8, E and F). In patients who had 
AFib without mitral regurgitation, AFib chroni- 
city was less pronounced (table S3), which may 
explain why we observed fewer atrial macro- 
phages in this cohort. 


A mouse model of atrial disease 


To dissect immune pathways involved in human 
AFib mechanistically, we used a mouse model 
that combined hypertension, obesity, and mitral 
valve regurgitation (HOMER) (Fig. 2A). Snaring 
the mitral valve to the ventricular wall produced 
mitral valve regurgitation (Fig. 2B). Mice 
consumed an obesogenic diet for 4 months, 
mirroring the risk that obesity engenders for 
AFib (23). Hypertension, also a risk factor for 
AFib, was induced with angiotensin 2 (fig. S9). 
The HOMER model elicited left atrial enlarge- 
ment (Fig. 2C) commonly found in patients 
with AFib (24). HOMER mice developed in- 
creased AFib inducibility and burden compared 
with those of sham-operated normotensive 
control mice that consumed standard chow 
(hereafter referred to as controls) (Fig. 2D and 
fig. S10). Among individual HOMER compo- 
nents, obesity elicited the highest AFib indu- 
cibility, whereas mitral valve regurgitation 
enlarged atria the most (fig. S11). Spontaneous 
AFib episodes, which were monitored with 
electrocardiogram telemeters, were absent in 
HOMER mice. In HOMER hearts, left ven- 
tricles hypertrophied and dilated mildly, and 
left ventricular systolic function was slightly 
reduced (fig. S12A and table S4). Blood leu- 
kocyte counts, including Ly6chi monocytes, 
rose in HOMER mice (fig. S12, B and C). 
Histological analysis revealed more numerous 
macrophages, fibrosis, and collagen deposi- 
tion in HOMER atria compared with those 
in controls (fig. S12, D and E), results that align 
with the fibrosis observed in patients with 
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AFib (25) and the effects of hypertension and 
metabolic stress on the ventricles (15, 26, 27). 
Hypertension and mitral valve regurgitation 
most clearly increased atrial macrophage num- 
bers and fibrosis (fig. S13). 

Replicating our human scRNA-seq pipeline 
by using left atrial tissues dissected from four 
HOMER mice and four control animals (Fig. 2E 
and fig. S14A), we successfully obtained 40,112 
viable single cells, among which we identified 
the same cell types as in human atria (Fig. 2F; 
fig. S14, B and C; and table S5). The cellular 
composition changes in diseased atria correl- 
ated strongly between both species [Pearson’s 
correlation coefficient (7) = 0.87] (Fig. 2G). We 
observed a twofold expansion of MP/DCs in 
HOMER mice similar to that in the human 
disease setting (Fig. 2F and table S5). Flow 
cytometry confirmed the HOMER-induced 
changes in the abundance of macrophages 
(Fig. 2H and fig. S15, A to C). Cardiac macro- 
phage subsets can be distinguished by expres- 
sion of CCR2 and major histocompatibility 
complex II (MHCID (28, 29). In atria of HOMER 
mice, the number of CCR2* macrophages and 
monocytes rose (fig. S15, D and E). 

We next examined the gene expression 
changes that occurred in MP/DCs of HOMER 
mice in relation to our human data. As before, 
we first performed GSEA on differentially ex- 
pressed genes from a pseudo-bulk analysis 
comparing MP/DCs isolated from atria of 
HOMER or control mice. We then plotted the 
significance and direction of change for all 
Molecular Signatures Database (MSigDB) C5 
(Gene Ontology) gene sets against the corres- 
ponding values in humans and found signifi- 
cant concordance between the two GSEA data 
sets [Spearman’s rank correlation coefficient 
(p) = 0.26, P< 2 x 10°*°] (Fig. 21 and data $7), 
indicating that the HOMER model recapitulated 
the pathway-level gene expression changes ob- 
served in patients. As in humans, SppI was 
among the top up-regulated genes in MP/DCs 
of HOMER atria (Fig. 2J). We then reclustered 
MP/DCs to obtain a higher-resolution view of 
these cells, choosing a resolution that sepa- 
rated three monocyte clusters [a number pre- 
viously described in mouse blood (30)], nine 
macrophage clusters, three DC clusters, two 
antigen-presenting cell clusters, and one SiglecF* 
cell cluster (Fig. 2K). SppI™ cell frequencies in- 
creased in HOMER atria (Fig. 2L), a rise driven 
by the macrophage subpopulations mac(e) and 
mac(g) and not by other immune or stromal 
cells (Fig. 2, M and N, and fig. S16, A and B). In 
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staining per field of view (FOV); n = 5 per group from two independent experiments, 
two-tailed Student's t test. (E) Scatterplot showing concordance between C5 
GOBP GSEA results in human and mouse fibroblasts in AFib and HOMER (as in 
Fig. 21). (Inset) Detailed GSEA results for gene set “GOBP collagen fibril organization” 
in human and mouse fibroblast clusters. NES, normalized enrichment score. 

All bar graph data are mean + SEM with individual values for data distribution. 


close concordance with our human findings, 
SppI* mac(e) and mac(g) clusters coexpressed 
Trem2 and Cd9 (Fig. 2M) and expanded in 
HOMER mice (Fig. 2, O and P, and fig. S16, C 
and D). Comparing MP/DC clusters across 
the two species (using the top 100 genes that 
distinguished each subcluster), we found that 
each human MP/DC cluster significantly matched 
at least one mouse cluster and that the human 
SPPI* cluster mac(c) was similar to the mouse 
Spp1* cluster mac(e) (fig. S17). This strong inter- 
species correlation is likely caused by similar risk 
factors in human patients and HOMER mice. : 

We next assayed contributions of individual 
HOMER components. Cer2 was expressed high- 
est in HOMER atria but also increased in 
hypertensive mice (fig. SI8A). Atrial expression 
of Csf1, chemokines, adhesion molecules, inflam- 
matory markers, and matrix increased the 
most in HOMER mice (fig. $18, B to F). Spp1 
expression increased in HOMER mice but 
not after exposure to single components (fig. 
S18G). Thus, HOMER components have addi- 
tive (for example, Ccr2 expression) and some 
synergistic effects (for example, Sppi expres- 
sion). Two of three HOMER components eli- 
cited intermediate atrial remodeling and Spp1 
expression (fig. $19). Although combining all 
HOMER components reflected the risk factors 
present in our patient cohort most faithfully, 
further studies of single conditions (for exam- 
ple, metabolic consequences of high-fat diet 
on conducting cells) are warranted. 

We then contrasted left atrial gene expres- 
sion changes to the left ventricular myocardium. 
Unlike the atria, we observed no significant 
expansion of ventricular leukocyte frequencies 
in HOMER mice (fig. S20, A to C, and table 
S6). The decline of fibroblast frequencies that 
occurred in HOMER atria was absent in the 
ventricles (fig. S20, A to C, and table S6). 
Compared with ventricular cells, more genes 
changed in atrial myeloid cells and fibroblasts. 
For example, there were 1982 atrial but only 
19 ventricular differentially expressed genes 
[with false discovery rate (FDR) < 0.05] in MP/ 
DCs (with nine differentially regulated genes 
shared between the atria and ventricles) (fig. 
$20, D and E, and data S8). This pattern was 
inversed for endothelial cells, which had more 
differentially expressed genes in the ventricles 
than in the atria (fig. S20, D and E, and data 
S8). A principal components analysis revealed 
that in controls, atrial MP/DCs were located at 
a large distance from ventricular MP/DCs. The 
distance between atrial and ventricular MP/DCs 
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was less pronounced in HOMER mice (fig. S20F). 
We hypothesize that HOMER-induced activation 
of inflammatory programs equalizes some of 
the differences normally imprinted by the atrial 
versus the ventricular environment. That the 
HOMER model affects left ventricular leuko- 
cytes to a lesser degree than in the left atria 
corresponds with the more severe atrial than 
ventricular dilation observed with magnetic 
resonance imaging (MRI) (Fig. 2C and fig. S12A). 
The ventricular data reveal that hypertension, 
high-fat diet, and mitral valve disease affect not 
only atria but the entire heart, a notion reflected 
by the clinical association of AFib with heart 
failure (31). 

The generation of the HOMER mouse and 
its scRNA-seq validation against patients with 
AFib provided us with a tool to assess whether 
immune cell expansion merely associates 
with—or alternatively, contributes to—AFib. 
We used the HOMER mouse for two follow-up 
studies that tested causal implications for key 
observations in patients with AFib: (i) expan- 
sion of inflammatory atrial macrophages and 
Gi) increased atrial SPP1 expression by those 
macrophages. 


CCR2-dependent macrophage recruitment 
promotes AFib 


Macrophage ontogeny often dictates function 
(32). For example, steady-state resident macro- 
phages in the heart and brain are distinctly 
noninflammatory (33), whereas recruited infarct 
or stroke macrophages are inflammatory and 
damage tissues (34). We consequently sought 
to investigate macrophage origins in normal 
and diseased atria. Because atrial macrophages 
expand in diseased atria of HOMER mice (Fig. 
2, F and H, and table S5) and enrich for the 
chemokine receptor CCR2 (fig. S15D), which 
recruited macrophages express (34), we com- 
bined the HOMER procedure with genetic fate 
mapping in Cx3crI"®’*4i9"* mice (fig. S21A). 
Although atrial macrophages were more likely to 
be derived from circulating monocytes than 
ventricular macrophages, most atrial macro- 
phages were not sourced from monocytes in 
steady state (fig. S21B). In HOMER mice, recruit- 
ment expanded so that 67% of atrial macro- 
phages derived from monocytes, a significant 
increase from the steady state (36%) (Fig. 3A). 
In line with increased atrial expression of Csf1 
(fig. SI8B) and up-regulation of the “Leukocyte 
proliferation” gene set by MP/DCs (fig. S21C), 
macrophage proliferation increased in HOMER 
atria (fig S21, D and E). Recruited macrophages 
in atria of HOMER mice were the primary 
source of Spp1, whereas locally sourced macro- 
phages did not express this gene at high levels 
(Fig. 3B). 

We used the HOMER procedure in Cerg”!~ 
mice to test the relevance of expanded macro- 
phage recruitment for AFib. Blocking monocyte 
migration in Ccr2/~ HOMER mice decreased 
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blood monocyte, atrial Ly6c™ monocyte, and 
macrophage expansion when compared with 
C57BL/6 HOMER mice, whereas other cell den- 
sities and blood pressure were unaffected 
(Fig. 3C and fig. S22, A to D). TREM2* CD9* 
macrophages were also lower in Cera!" HOMER 
mice (Fig. 3D). Cardiac MRI revealed smaller 
left atrial volumes in Cer2~/- HOMER mice 
(fig. S22E). Ccr2-/~ HOMER mice exhibited 
lower AFib inducibility and AFib burden than 
those of C57BL/6 HOMER mice (Fig. 3E). 
Thus, macrophage recruitment expands during 
atrial remodeling, recruited macrophages highly 
express Spp/, and inhibiting macrophage re- 
cruitment reduces AFib. 

To test whether CCR2 inhibition could alle- 
viate established disease, we treated mice with 
the CCR2 antagonist CCL2-Fc (35) for 4 weeks 
after the last HOMER treatment component 
(fig. S23A). CCR2 inhibition reduced fibrosis in 
atria of HOMER mice (fig. S23, B and C). AFib 
inducibility and burden were lower in the 
treatment group, although this effect was not 
statistically significant (fig. S23D). 


Macrophage-derived SPP1 is a pleiotropic 
AFib catalyst 


Given that Spp/ is among the most up-regulated 
macrophage genes in AFib, we examined the 
causal role of this matricellular protein. We 
transplanted bone marrow from Sppr/ ~ mice 
into wild-type recipients (Sppr’/~ BMT HOMER) 
to delete Spp/ in recruited macrophages (fig. S24, 
A and B). Wild-type recipients that received 
wild-type bone marrow and then underwent 
HOMER (C57BL/6 BMT HOMER) served as 
controls. Blood pressure and leukocyte counts 
were similar in both cohorts (fig. S24, C to E). 
Left atrial and ventricular volumes were pre- 
served and left ventricular hypertrophy was 
lessened in mice in which Spp1 was deleted 
(fig. S24F). Deleting Spp7 in bone marrow- 
derived cells reduced AFib inducibility and 
AFib burden (Fig. 4A). 

To outline mechanisms by which osteopon- 
tin exacerbates AFib, we investigated the cross- 
talk between SppI-expressing macrophages 
and other atrial cells, using CellPhoneDB (36). 
In HOMER mice and patients with AFib, 
paracrine and autocrine interactions were found 
between macrophage-derived SPP1 and cognate 
SPP1 receptors expressed by immune and 
stromal cells (Fig. 4B and fig. S25). In HOMER 
mice, CellPhoneDB analysis implied SPP1* 
macrophage interactions with client cells via 
three integrins (04), ayB3, and O98), CD44, 
and prostaglandin E, receptor 4 (EP,/PTGER4). 
These interactions, which included signaling 
to various macrophage and fibroblast clusters, 
suggested that SPP1 acts as a pleiotropic am- 
plifier for atrial inflammation and _ fibrosis, 
which in turn contribute to an AFib substrate (37). 

In support of this idea, the histology of 
atria collected from Sppr! ~ BMT HOMER 
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mice showed fewer CD68* macrophages when 
compared with that of C57BL/6 BMT HOMER 
mice (Fig. 4, C and D), indicating that SPP1 may 
influence atrial macrophage supply. Atrial mono- 
cyte numbers were unchanged in Sppr! ~ BMT 
HOMER mice (fig. S26A), making a role of SPP1 
in monocyte recruitment unlikely. TUNEL (ter- 
minal deoxynucleotidyl transferase-mediated 
deoxyuridine triphosphate nick end labeling) 
and CD68 staining of atria indicated that macro- 
phage death was also unchanged in the cohort 
in which SppI was deleted (fig. S26B). However, 
macrophage proliferation, which is up-regulated 
by HOMER conditions (fig. S21, D and E), de- 
clined in Sppr! ~ BMT HOMER compared 
with that in C57BL/6 BMT HOMER mice (fig. 
$26, C and D). Thus, SPP1 augments atrial 
inflammation through increased macrophage 
proliferation. 

Matrix production by fibroblasts increased 
in HOMER mice (S26, E and F), and immuno- 
reactive o smooth muscle actin (aSMA) stain- 
ing of HOMER atria documented an expansion 
of activated fibroblasts, which were reduced in 
Sppr’/- BMT HOMER mice (fig. $26, G to J). 
Atrial fibroblasts isolated from Sppr! ~ BMT 
HOMER mice expressed less collagen than 
those from C57BL/6 BMT HOMER (fig. S26K). 
Furthermore, we detected reductions in colla- 
gen I deposition and Masson’s trichrome stain- 
ing in atria of Spp! ~/- BMT HOMER mice (Fig. 
4, C and D), which points to reduced fibroblast 
matrix production if these cells do not sense 
SPP1. This interaction may be particularly re- 
levant because integrins—which, according to 
our CellPhoneDB analysis, fibroblasts use to 
sense SPP1—regulate fibroblast migration, pro- 
liferation, and activation, processes that pre- 
cede increased tissue fibrosis (38, 39). 

Comparing left atrial cells from Sppr! ~ 
BMT HOMER with those from C57BL/6 BMT 
HOMER mice by means of scRNA-seq (fig. 
$27, A and B), we identified fibroblast clusters 
that expressed collagens at high levels (fig. S27, 
Cto E). For these clusters, we pursued GSEA to 
examine through which pathways SPP1 acti- 
vates atrial fibroblasts (fig. S27F). Fibroblast 
clusters from C57BL/6 BMT HOMER mice up- 
regulated TgfbI gene sets. Inflammatory acti- 
vation of fibroblasts by SPP1 was supported by 
the up-regulation of inflammatory gene sets in 
fibroblasts from C57BL/6 BMT HOMER when 
compared with Sppr! ~ BMT HOMER mice 
(fig. S27F). Thus, SPP1 participates in inflam- 
matory fibroblast activation partially through 
transforming growth factor-B (TGF-B) path- 
ways. In atrial tissue from patients with AFib, 
SPP1 colocalization with collagen 1A1 expanded 
(fig. S8, C and D), lending further credence to 
SPP1 promotion of AFib through enhanced 
fibrosis. 

To ascertain whether the transcriptional 
changes that occurred in atrial fibroblasts of 
HOMER mice are representative of alterations 


7of8 


RESEARCH | RESEARCH ARTICLE 


in humans, we compared human and mouse 
fibroblast transcriptomes. We first performed 
GSEA (again using all C5 ontology MSigDB gene 
sets) on differentially expressed genes from a 
pseudo-bulk analysis comparing all fibroblast 
clusters isolated from atria of patients with 
AFib versus humans without AFib, as well as 
HOMER mice versus control mice. We then 
plotted the significance and direction of change 
for all gene sets for both species. Both GSEA 
data sets correlated strongly (Spearman’s p = 
0.55, P < 2 x 107"*) with gene sets related to 
immune response and collagen organization 
enriched in diseased atria (Fig. 4E and data S9). 
This pathway-level interspecies correlation fur- 
ther bolstered the physiological relevance of the 
HOMER model while also underscoring inflam- 
mation and fibrosis as central processes in AFib. 


Discussion 


In this study, we comprehensively assayed 
atria from patients and mice and documented 
large-scale shifts in the cellular frequencies 
and phenotypes during AFib. Macrophages 
expand more than any other cell type in AFib. 
Experiments in mice, in which either Ccr2 or 
Spp1 was deleted, demonstrated that targeting 
macrophages strengthens resilience against 
AFib (fig. $28). 

In human and mouse atria, SPP] was among 
the top up-regulated macrophage genes. This 
matricellular protein, which increases in the 
blood of patients with AFib (40), stabilizes 
collagen (41, 42) and promotes fibrosis in 
hypertension (43, 44). We speculate that 
macrophage-derived SPP1 (45) augments atrial 
tissue heterogeneity and hinders conduction 
between cardiomyocytes. SPP1 signals to most 
other atrial cell classes, enhancing inflamma- 
tion and fibrosis through interaction with 
various cell surface receptors. In the ventricu- 
lar myocardium and in other organs, such 
signaling promotes myeloid cell activation 
and migration as well as extracellular matrix 
production by fibroblasts (13, 15, 43, 44). Thus, 
targeting SPP1 may have several distinctive 
effects that together preserve atrial conduc- 
tion. The expanding numbers and altered 
phenotypes of atrial macrophages may have 
other disease-promoting consequences. For 
example, a loss of resident macrophages’ relation- 
ships with cardiomyocytes (46, 47) may com- 
promise atrial conduction. 

Going forward, therapeutic strategies could 
target recruitment of inflammatory macro- 
phages (CCR2) or macrophage-derived signals 
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(SPP1). Macrophage-targeted therapy for atrial 
cardiomyopathy is most likely to succeed with 
concomitant normalization of risk factors—for 
example, through surgical valve repair, weight 
loss, and blood pressure management. Our data 
centrally position macrophages as targets for 
immunotherapy in patients with AFib. 
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WORKING LIFE 


By Ananya Sen 


242 


At my own pace 


ell me your names and which year of graduate school you're in,” the visiting speaker said. Our 
department was hosting them to give a guest seminar, and six of us graduate students had been 
chosen to accompany them for lunch. I was looking forward to talking science and learning about 
the speaker’s career, but I dreaded this question. My discomfort grew as everyone else gave their 
answers. When my turn came, I swallowed and said I was in my sixth year. “I think if someone 
takes more than 5 years the student is the problem,” the speaker replied tersely. I was devastated. 


Right from the start of grad school, 
I had been aware that the require- 
ments of my department and ad- 
viser might make my Ph.D. take 
longer than average. Per my depart- 
ment, students need to publish two 
first-author papers to graduate. My 
adviser requires students to take 
the lead on all experiments so they 
have full ownership of their work 
and develop an arsenal of scientific 
techniques. I knew this going in. 
Although I didn’t relish the thought 
of getting by on a meager graduate 
student salary for longer than nec- 
essary, it wasn’t a dealbreaker for 
me. I was comfortable with the po- 
tential trade-off of taking longer to 
finish in order to become an inde- 
pendent scientist and pursue the re- 
search that excited me. My adviser 
had a list of projects he wanted to 
tackle and I chose one to work on, knowing I could draw on 
his expertise and support from lab mates as I went. 

But as the years passed, I began to worry. I investigated 
10 different hypotheses—just to prove each a dead end. We 
didn’t want to publish papers that raised more questions than 
they answered. Nor did we want to abandon the project; as 
we eliminated each hypothesis, we felt we must be getting 
closer to the answer. So, each time the only option was to for- 
mulate the next likely hypothesis and start over in a different 
direction. But it seemed I was getting nowhere closer to grad- 
uating. The lack of tangible progress was wearing me down, 
and the raised eyebrows whenever I told anyone how long 
Thad been in graduate school did nothing to lift my spirits. 

The speaker’s comment was the final straw. I went to my 
adviser and broke down. Didn’t the speaker understand how 
hard it was to conduct research? Why were other scientists 
judging me for how long I had been in graduate school? Why 
did I always need to apologize for it? 

My adviser was kind and supportive, as I had known he 
would be. He pointed out that over the past 6 years I had 


“No one’s Ph.D. journey is the 
same, and no one deserves 
to be shamed for how long it takes.” 


mastered many difficult techniques. 
My growing expertise meant no 
one could suggest an experiment I 
hadn’t already done. If I didn’t have 
the answer to my research question 
yet it was because the answer was 
complex, not because of personal 
shortcomings. My adviser had been 
working on this research problem 
for more than 3 decades, after all. 

I was slow to stop blaming my- 
self. But eventually, I saw what he 
was saying. I had done everything 
I could to answer my research 
question—running experiments in 
parallel to attack multiple hypoth- 
eses more efficiently, working late 
nights, and coming to lab before 
sunrise to set up experiments. I 
should be proud of my hard work 
and thorough investigation of many 
promising but ultimately unfruitful 
directions. I had nothing to apologize for. 

My new perspective helped calm me when other students 
in my cohort graduated before I did. It helped me ignore the 
jabs and explain why my Ph.D. journey was taking longer. It 
helped me push forward with determination and confidence. 
And eventually, thanks to that hard work and persistence, 
and my adviser’s continued guidance and support, we hit on 
a hypothesis that panned out. Seven-and-a-half years after 
starting my Ph.D., I graduated. 

Toward the end, I realized many graduate students were in 
the same boat as I was, hiding in their labs and not talking to 
anyone just to avoid answering how long they had been in the 
program. I wish we hadn’t been made to feel that way. I wish I 
could tell them—and those who made them feel bad—what I 
came to learn: No one’s Ph.D. journey is the same, and no one 
deserves to be shamed for how long it takes. 


Ananya Sen is a science writer for the Carl R. Woese Institute for 
Genomic Biology at the University of Illinois Urbana-Champaign. 
Send your career story to SciCareerEditor@aaas.org. 
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new products: sample preparation 
specifically for culturing spheroids. The 


BIOFLOAT surface is anti-adhesive, which 


repels cells of all types, preventing them from attaching to the 
surface of the plate. This, along with the round bottoms of the wells, 
results in cells growing into 3D spheroids. The BIOFLOAT surface 
enables rapid spheroid formation—between 8 and 24 hours— 

with very high circularity and reproducibility compared to other 
surfaces. The chemical composition of the BIOFLOAT surface is very 
strong, which allows it to withstand multiple washing steps and/ 

or mechanical action from pipetting. Additionally, the BIOFLOAT 
surface has proven to be very reliable, even when using difficult 
cells. In fact, many cell lines and some primary cells have already 
been successfully tested for spheroid formation on the BIOFLOAT 
surface. 

SARSTEDT 

For info: 1-800-257-5101 

www.sarstedt.com 


Ultra-Low Adherence Cell Culture Plate 
SARSTEDT introduces the BIOFLOAT 96- 
well Ultra-Low Adherence cell culture plate 


Tissue-Clearing Reagents for Plants 

Tokyo Chemical Industry offers Tissue Clearing Reagent iTOMEI D 
and Tissue Clearing Reagent iTOMEI M, which enable transparency 
of plant organs and clear detection of fluorescent protein in 3D 
imaging and are applicable in many plant species, such as Oryza 
sativa, Arabidopsis thaliana, and Marchantia polymorpha. Tissue 
Clearing Reagent iTOMEI D is a decoloring solution to elute stains of 
plant organs like chlorophyll, and it enables transparency of plants 
by simple osmosis in just a few days. The main advantage of Tissue 
Clearing Reagent iTOMEI D is that it can suppress autofluorescence 
in plant organs while retaining the fluorescence of fluorescent 
proteins, such as GFP and tdTomato, coexpressed as reporter 
genes. Meanwhile, Tissue Clearing Reagent iTOMEI M is a mounting 
solution suitable for observation using silicon lens, which adjusts the 
refractive index in the plant organ to 1.40. 

Tokyo Chemical Industry 

For info: +81 (0)3 5640 8878 

www.tcichemicals.com 


FDA-Certified Tubing Solutions 

Biotech Fluidics has launched a range of traceable, certified PEEK, 
and fluoropolymer-flexible tubing for use in regulated environments. 
In many medical and life science applications, biocompatibility 
requirements from regulatory standards including ISO 10993 
(medical device and dental materials), US Pharmacopeia Class VI 
(toxicity), FDA title 21, and EU Regulation 1907/2006 (DEHP content) 
strictly govern what flexible plastic tubing can be used. To serve 
these medical and life science applications, Biotech Fluidics is now 
able to offer high quality, FDA-certified flexible plastic tubing in 

bulk and pre-cut quantities. In addition to regulatory compliance, 
every supply of this new tubing comes with traceability information 
including manufacturing lot number. If required, Biotech Fluidics can 
also deliver sterilized tubing assemblies prepared in a clean room 
environment. 

Biotech Fluidics 

For info: +1-612-703-5718 

www.biotechfluidics.com/products/tubing 
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Microcentrifuge with Hydrocarbon Cooling 

Eppendorf offers Centrifuge 5427 R—the first Eppendorf 
microcentrifuge with hydrocarbon cooling to contribute to an 
even more sustainable laboratory environment. With this offering, 
customers can now perform various molecular and cell biology 
applications while using a refrigerated device that contains a natural 
cooling agent with a GWP of almost zero. Natural cooling agents, like 
R290 (propane), have a comparably low global warming potential 
(GWP) to CO, (<3), while conventional refrigerants such as R134a 
have a GWP of 1430 and thus have a disproportionately greater 
impact on global warming when released into the environment. The 
ACT label certification of Centrifuge 5427 R makes it even easier 

for customers to choose a more sustainable product. Due to its 
compact footprint and the dual-row rotor FA-45-48-11 for up to 48 x 
1.5/2 mL tubes, it is the optimal solution for laboratories with a high 
sample throughput. 

Eppendorf 

For info: +49 2232 418-0 

https://eppendorf.group/5427R 


2D-Barcoded Tube Whole Rack Reader 

Ziath announces a revolution in cryogenic sample management—the 
Ri-Track Mirage 2D barcoded tube whole rack reader. Traditionally, 
identifying the identity of frost-covered racks of 2D barcoded 

tubes taken from low-temperature storage has been a time- 
consuming and error-prone process. Drawing upon proprietary 
Radio Frequency Identification (RFID) tagging technology, the 
Ri-Track Mirage can reliably read 2D barcoded racks covered in ice 
and with racks taken directly from vapour-phase liquid nitrogen 
storage. The need to warm your rack to thaw it out—which may 
degrade thermally sensitive samples—to read your identity of 

your sample rack is eliminated. Extensive testing has shown that 
Ri-Track RFID technology can withstand immersion and retrieval 
from vapour-phase liquid nitrogen more than 10,000 times without 
losing any stored data or showing any physical degradation. Ri-Track 
is therefore suitable for all forms of traceable long-term cryogenic 
sample storage. 

Ziath 

For info: +1-858-880-6920 

www.ziath.com 


Lab Balance 

Weighing in a regulated environment can be complex, with stringent 
compliance directives and controls required to deliver the accuracy 
and reliability of results required by the pharmaceutical industry. 
One of the technologies helping scientists to achieve this level of 
accuracy is the Cubis II balance series from Sartorius. All weighing 
modules of the Cubis II balance are designed for intuitive operation, 
further aided by intelligent diagnostic systems. These design 
elements guarantee a higher degree of repeatability for different 
workflows while lowering the probability of human error during 
measurement. It offers modern user interfaces, pharmaceutical 
and GxP compliance, including data handling, data integrity, and 
connectivity, ergonomic sample handling, easy process integration, 
and unlimited communication at the highest level of accuracy and 
precision. 

Sartorius 

For info: +4-551-308-0 

www.sartorius.com 


Electronically submit your new product description or product literature information! Go to www.science.org/about/new-products-section for more information. 
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